VectorCalculus 


SIXTH EDITION 


Jerrold E. Marsden 


Anthony Tromba 


VectorCalculus 


SIXTH ED DITION 


A nthony Tromba 
University of California, Santa Cruz 


W. H. Freeman and Company - New York 


Publisher: Ruth Baruth 

Executive Editor: Terri Ward 

Executive Marketing Manager: Jennifer Somerville 
Associate Editor: Katrina Wilhelm 

Senior Media Editor: Laura Capuano 

Editorial Assistant: Tyler Holzer 

Project Editor: Vivien Weiss 

Art Director: Diana Blume 

Director of Production: Ellen Cash 

Illustration Coordinator: Bill Page 

Illustrations: Network Graphics 

Photo Editor: Ted Szczepanski 

Compositor: MPS Limited, a Macmillan Company 
Manufacturer: Quad Graphics 

Cover Image: Robert Wilson 


Politics is for the moment. 
An equation is for eternity. 


A. EINSTEIN 


Some calculus tricks are quite easy. 

Some are enormously difficult. The fools 

who write the textbooks of 

advanced mathematics seldom take the trouble 
to show you how easy the easy calculations are. 


SILVANUS P. THOMPSON, CALCULUS MADE Easy, MACMILLAN (1910) 


Library of Congress Control Number: 2011931725 


ISBN-13: 978-1-4292- 1508-4 
ISBN-10: 1-4292-1508-9 


©2012, 2003, 1996, 1988, 1981, 1976 by W. H. Freeman and Company 
All rights reserved. 

Printed in the United States of America 

First printing 

W. H. Freeman and Company Publishers 

41 Madison Avenue 

New York, NY 10010 


Houndmills, Basingstoke RG21 6XS, England 
www.whfreeman.com 


Contents 


Preface ix 

Acknowledgements xi 

Historical Introduction: A Brief Account xiii 
Prerequisites and Notation xxiii 


1 The Geometry of Euclidean Space 1 


1.1 Vectors in Two- and Three-Dimensional Soace 1 
1.2 Tne Inner Product, Length, and Distance 19 

1.3 Matrices, Determinants, and the Cross Product 31 
1.4 Cylindrical and Spherical Coordinates 52 

1.5 nm-Dimensional Euclidean Space 60 


Review Exercises for Chapter 1 70 


2 Differentiation 75 


2.1 The Geometry of Real-Valued Functions 76 
2.2 Limits and Continuity 88 

2.3 Differentiation 105 

2.4 Introduction to Paths and Curves 116 

2.5 Properties of the Derivative 124 

2.6 Gradients and Directional Derivatives 135 


Review Exercises for Chapter2 144 


3 Higher-Order Derivatives: 


Maxima and Minima 149 


3.1 Iterated Partial Derivatives 150 

3.2 Taylor's Theorem 158 

3.3 Extrema of Real-Valued Functions 166 

3.4 Constrained Extrema and Lagrange Multipliers 185 

3.5 The Implicit Function Theorem (Optional) 203 
Review Exercises for Chapter 3 211 


Contents 


4 Vector-Valued Functions 217 


4.1 Acceleration and Newton's Second Law 217 
4.2 ArcLlLength 228 
4.3 Vector Fields 236 
4.4 Divergence and Curl 245 
Review Exercises for Chapter 4 260 


5 Double and Triple Integrals 263 


5.1 Introduction 263 

5.2 The Double Integral Over a Rectangle 271 

5.3 The Double Integral Over More General Regions 
5.4 Changing the Order of Integration 289 

5.5 The Triple Integral 294 


Review Exercises for Chapter5 304 


6 The Change of Variables Formula and 


Applications of Integration 307 


283 


6.1 The Geometry of Maps from R? to R? 308 
6.2 The Change of Variables Theorem 314 
6.3 Applications 329 

6.4 Improper Integrals (Optional) 339 


Review Exercises for Chapter 6 347 


7 Integrals Over Paths and Surfaces 351 


7.1 The Path Integral 351 

7.2 Line Integrals 358 

7.3 Parametrized Surfaces 375 

7.4 Area ofa Surface 383 

7.5 Integrals of Scalar Functions Over Surfaces 393 
7.6 Surface Integrals of Vector Fields 400 


7.7 Applications to Differential Geometry, Physics, 
and Forms of Life 413 


Review Exercises for Chapter 7 423 


Contents 


8 The Integral Theorems of Vector Analysis 427 


8.1 Green's Theorem 428 
8.2 Stokes’ Theorem 439 
8.3. Conservative Fields 453 
8.4 Gauss’ Theorem 461 
8.5 Differential Forms 476 


Review Exercises for Chapter 8 490 


Answers to Odd-Numbered Exercises 493 
Index 533 
Photo Credits 545 


this page left intentionally blank 


To Jerrold E. Marsden, 1942-2010 


Jerry Marsden, Carl F. Braun distinguished Professor at the 
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Preface 


This text is intended for a one-semester course in the calculus of functions of several 
variables and vector analysis, which is normally taught at the sophomore level. In addi- 
tion to making changes and improvements throughout the text, we have also attempted 
to convey a sense of excitement, relevance, and importance of the subject matter. 


Prerequisites 


Sometimes courses in vector calculus are preceded by a first course in linear algebra, 
but this is not an essential prerequisite. We require only the bare rudiments of matrix 
algebra, and the necessary concepts are developed in the text. If this course is preceded 
by a course in linear algebra, the instructor will have no difficulty enhancing the material. 
However, we do assume a knowledge of the fundamentals of one-variable calculus—the 
process of differentiation and integration and their geometric and physical meaning as 
well as a knowledge of the standard functions, such as the trigonometric and exponential 
functions. 


The Role of Theory 


The text includes much of the basic theory as well as many concrete examples and 
problems. Some of the technical proofs for theorems in Chapters 2 and 5 are given 
in optional sections that are readily available on the Book Companion Web Site at 
www.whfreeman.com/marsdenvc6e (see the description on the next page). Section 2.2, 
on limits and continuity, is designed to be treated lightly and is deliberately brief. More 
sophisticated theoretical topics, such as compactness and delicate proofs in integration 
theory, have been omitted, because they usually belong to a more advanced course in 
real analysis. 


Concrete and Student-Oriented 


Computational skills and intuitive understanding are important at this level, and we 
have tried to meet this need by making the book concrete and student-oriented. For 
example, although we formulate the definition of the derivative correctly, it is done 
by using matrices of partial derivatives rather than abstract linear transformations. We 
also include a number of physical illustrations such as fluid mechanics, gravitation, 
and electromagnetic theory, and from economics as well, although knowledge of these 
subjects is not assumed. 


Order of Topics 


A special feature of the text is the early introduction of vector fields, divergence, and curl 
in Chapter 4, before integration. Vector analysis often suffers in a course of this type, 
and the present arrangement is designed to offset this tendency. To go even further, one 
might consider teaching Chapter 3 (Taylor’s theorems, maxima and minima, Lagrange 
multipliers) after Chapter 8 (the integral theorems of vector analysis). 


New to This Edition 


This sixth edition was completely redesigned, but retains and improves on the balance 
between theory, applications, optional material, and historical notes that was present in 
earlier editions. 
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Preface 


We are excited about this new edition of Vector Calculus, especially the inclusion of 
many new exercises and examples. The exercises have been graded from less difficult 
to more difficult, allowing instructors to have more flexibility in assigning practice 
problems. The modern redesign emphasizes the pedagogical features, making the text 
more concise, student-friendly, and accessible. The quality of the art work has been 
significantly improved, especially for the crucial three-dimensional figures, to better 
reflect key concepts to students. We have also trimmed some of the historical material, 
making it more relevant to the mathematics under discussion. Finally, we have moved 
some of the more difficult discussions in the fifth edition—such as those on Conservation 
Laws, the derivation of Euler’s Equation of a Perfect Fluid, and a discussion of the Heat 
Equation—to the Book Companion Web Site. We hope that the reader will be equally 
pleased. 


Supplements 


The following electronic and print supplements are available with Vector Calculus, Sixth 
Edition: 


1. Book Companion Web Site. www.whfreeman.com/marsdenvc6e The Book 
Companion Web Site contains the following materials: 


e Additional Content contains additional material suitable for projects as well as 
technical proofs and sample examinations with complete solutions. Also 
included are discussions of the second derivative test for constrained extrema, a 
look at Kepler’s laws and the solution to the two-body problem, a further 
discussion of Feynman’s view of The Principle of Least Action and of how cats 
fall and astronauts reorient themselves in space, a look at some further 
differential equations in Mechanics, and an examination of Green’s function 
methods in partial differential equations. 


e PowerPoint and KeyNote Slides for instructors to use in presentations of the 
text’s figures, as well as section-by-section summaries. 


° BIFX and PDF Files of Sample Exams (on instructor’s password-protected site) 


2. Student Study Guide with Solutions. This student guide contains helpful hints 
and summaries for the material in each section, and the solutions to selected 
problems. Problems whose solutions appear in the Student Study Guide have a 
colored number in the text for easy reference. The guide has been revised and reset 
for the Sixth Edition of Vector Calculus. 


3. Instructor’s Manual with Solutions. This supplement contains material available 
only to instructors. This includes summaries of material and additional worked-out 
examples that are helpful in the preparation of lectures. It also contains additional 
solutions to problems and sample exams (some of them with complete solutions). 


4. Final Exam Questions. There are practice exams available on the Book 
Companion Web Site as well as in the Instructor’s Manual. The level and choice of 
topics and the lengths of final exams will vary from instructor to instructor. Working 
these problems requires a knowledge of most of the main material of the book, 
and solving 10 of these problems should take the reader about 3 hours to complete. 


Jerry Marsden and Tony Tromba, 
Caltech and UC Santa Cruz, Summer 2010. 
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Historical Introduction: 
A Brief Account 


This, therefore, is Mathematics; she reminds you of the invisible form of the soul; she 
gives life to her own discoveries; she awakens the mind and purifies the intellect; she 
brings light to our intrinsic ideas; she abolishes oblivion and ignorance which is ours 


by birth. —Proclus, c. 450 


Cum Deus Calculat Fit Mundus. 


(As God calculates, so the world is created). —Leibniz, c. 1700 


The word mathematics derives from the Greek word mathema, meaning knowledge, 
cognition, understanding, or perception, suggesting that the study of what we now call 
mathematics began by asking questions about the world. In fact, the historical evidence 
suggests that mathematics began about 2700 years ago as an attempt to comprehend 
nature. Unfortunately, in most mathematical expositions, historical motivations and 
contexts are often sacrificed. In this new edition, the authors continue to address this 
problem by including the discussion of historical and contextual material where appro- 
priate. Therefore, before we dive into the mathematics of Vector Calculus, we briefly 
discuss the development of mathematics prior to and including the discovery of calculus. 


Egyptian, Babylonian, and Greek Mathematics 


It is generally acknowledged that mathematics developed in the seventh and sixth cen- 
turies B.C., somewhat after the Greeks had developed a uniform alphabet. This is not to 
say, however, that mathematical knowledge did not exist before the Greeks. In fact, 
the Egyptians and Babylonians knew many empirical facts centuries before the rise of 
the Greek civilization. For example, they could solve quadratic equations, compute the 
areas of certain geometric figures, such as squares, rectangles, and triangles, and they 
possessed a reasonably good formula for the area of a circle, using the value of 3.16 for 
x. They also knew how to compute certain volumes like the size of cubes, rectangles, 
rectangular solids, cones, cylinders, and (not surprisingly) pyramids. The ancients were 
also acquainted with the Pythagorean theorem (at least empirically). 

The Greeks, who settled throughout the Mediterranean, must have played an 
important role in preserving and spreading the mathematical knowledge of the Egyp- 
tians and the Babylonians. However, the Greeks were aware that there were different 
formulas for the same area or volumes. For example, the Babylonians had one formula 
for the volume of a frustum of a pyramid with a square base, and the Egyptians had 
another (see Figure 1). 

It is not surprising that the Egyptians (with the experience in pyramid construc- 
tion) had the correct formula. Now, given two formulas, it was clear that only one could 
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be correct. But how could one decide such an answer? Certainly it is not a question 
for debate, as would be the question of the quality of works of art. It is likely that the 
necessity to determine the answers to such questions is what led to the development of 
mathematical proof and to the method of deductive reasoning. 

The person usually credited for the invention of rigorous mathematical proof 
was a merchant named Thales of Miletus (624-548 B.c.). It is Thales who is said 
to be the creator of Greek geometry, and it was this geometry (earth measure) as an 
abstract mathematical theory (rather than a collection of empirical facts) supported by 
rigorous deductive proofs that was one of the turning points of scientific thinking. It led 
to the creation of the first mathematical model for physical phenomena. 

For example, one of the most beautiful geometric theories developed during 
antiquity was that of conic sections. See Figure 2. 

Conics include the straight line, circle, ellipse, parabola, and hyperbola. Their discov- 
ery is attributed to Menaechmus, a member of the school of the great Greek philosopher 
Plato. Plato, a student of Socrates, founded his school The Academy (see Figure 3) in a sa- 
cred area of the ancient city of Athens, called Hekadameia (after the hero Hekademos). 
All later academies obtained their name from this institution, which existed without 
interruption for about 1000 years until it was dissolved by the Roman Emperor 
Justinian in 529 C.E. 


figure 2 The conic sections: (A) hyperbola, 
(B) parabola, (C) ellipse, (D) circle. 


figure 3 Plato's Academy 
(mosaic found in Pompeii, Villa 
of T. Siminius Stephanus, 

86 x 85 cm, Naples, 
Archaeological Museum). With 
certainty the seven men have 
been identified as Plato (third 
from the left) and six other 
philosophers, who are talking 
about the universe, the celestial 
spheres, and the stars. The 
mosaic shows Plato’s Academy, 
with the city of Athens in the 
background. It is probably a 
copy (from the first century B.C.) 
of a Hellenistic painting. 
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Plato suggested the following problem to his students: 
Explain the motion of the heavenly bodies by some geometrical theory. 


Why was this a question of interest and puzzlement for the Greeks? Observed 
from the Earth, these motions appear to be quite complicated. The motions of the 
sun and the moon can be roughly described as circular with constant speed, but the de- 
viations from the circular orbit were troublesome to the Greeks and they felt challenged 
to find an explanation for these irregularities. The observed orbits of the planets are 
even more complicated, because as they go through a revolution, they appear to reverse 
direction several times. 

The Greeks sought to understand this apparently wild motion by means of their 
geometry. Eudoxus, Hipparchus, and then Apollonius of Perga (262-190 B.C.) suggested 
that the celestial orbits could be explained by combinations of circular motion (that is, 
through the construction of curves called epicycles traced out by circles moving on 
other circles). This idea was to become the most important astronomical theory of 
the next two thousand years. This theory, known by us through the writings of the 
Greek astronomer Ptolemy of Alexandria, ultimately becomes known as the “Ptolemaic 
theory.” See Figures 4 and 5. 

Most of Greek geometry was codified by Euclid in his Elements (of Mathematics). 
Actually the Elements consist of thirteen books, in which Euclid collected most of 
the mathematical knowledge of his age (circa 300 B.C.), transforming it into a lucid, 
logically developed masterpiece. In addition to the Elements, some of Euclid’s other 
writings were also handed down to us, including his Optics and the Catoptrica (theory 
of mirrors). 

The success of Greek mathematics had a profound effect on views of nature. The 
Platonists, or followers of Plato, distinguished between the world of ideas and the world 
of physical objects. Plato was the first to propose that ultimate truth or understanding 


xvi 


Historical Introduction 


cares) 
2 ee 


<< 


figure 4 Woodcut from Georg von Peurbach’s 
Theoricae novae planetarum, edited by Oronce 
Fine as a teaching text for the University of Paris 
(1515). It was the canonical description of the 
heavens until the end of the sixteenth century, 
and even Copernicus was to a large extent 
under the influence of this work. Peurbach 
described the solid sphere representations of 
Ptolemaic planetary models, which he probably 
based on Ibn al-Haytham’s work “On the 
configuration of the world” (translated into Latin 
in the thirteenth century). The same frontispiece 
was used for the Sacrosbosco edition of the first 
four books of Euclid’s Elements (in excerpts), 
which appeared under the title Textus de 
Sphaera in Paris (1521). 


figure 5 Ptolemy observing the stars with 

a quadrant, together with an allegoric 
Astronomia. (From Gregorius Reish, Margarita 
Philosophica nova, Strasbourg, 1512, an early 
compendium of philosophy and science.) In 
those days, Ptolemy was often depicted as a 
king, because he was erroneously thought to be 
descended from the Ptolemaic dynasty that 
ruled Egypt after Alexander. 


could not come from the material world, which is constantly subject to change, but only 
from mathematical models or constructs. Thus, infallible knowledge could be attained 
only through mathematics. Plato not only wished to use mathematics in the study of 
nature, but he actually went so far as to attempt to substitute mathematics for nature. 
For Plato, reality lies only within the realm of ideas, especially mathematical ideas. 
Not everyone in antiquity agreed with this point of view. Aristotle, a student of Plato, 
criticized Plato’s reduction of science to the study of mathematics. Aristotle thought that 
the study of the material world was one’s primary source of reality. Despite Aristotle’s 
critique, the view that mathematical laws governed the universe took a firm hold on 
classical thought. The search for the mathematical laws of nature was underway. 
After the death of Archimedes in 212 B.C., Greek civilization went into a period of 
slow decline. The final blow to Greek civilization came in 640 A.D. with the Moslem 


figure 7 Detail from the Codex 
Vigilanus (976 A.D. northern 
Spain). The first known 
occurrence of the nine Indo- 
Arabic numerals in Western 
Europe. (Escurial Library, Madrid.) 
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conquest of Egypt. The remaining Greek texts housed in the great library in Alexandria 
were burned. Those scholars who survived migrated to Constantinople (now part of 
Turkey), which had become the capitol of the Eastern Roman Empire. It was in this 
great city that what survived of Greek civilization was preserved for its rediscovery by 
European civilization some five hundred years later. 


Indian and Arabian Mathematics 


Mathematical activity did not, however, cease with the decline of Greek civilization. In 
the middle of the sixth century, somewhere in the Ganges Valley in India, our modern 
system of numeration evolved. The Indians developed a number system based on ten, 
with ten rather abstract symbols from zero to nine looking “roughly” as they do today. 
They developed rules for addition, multiplication, and division (as we have today), a 
system infinitely superior to the Roman abacus, which was used (by a special class 
of servants called arithmeticians) throughout Europe until the fifteenth century. See 
Figure 6. 

After the fall of Egypt, came the rise of Arab civilization centered in Baghdad. Schol- 
ars from Constantinople and India were invited to study and to share their knowledge. 
It was through these contacts that the Arabs came to acquire the learning of the ancients 
as well as the newly discovered Indian system of numeration. See Figure 7. 


figure 6 Arithmetician performing a calculation 
on a counter-abacus. 
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It was the Arabs who gave us the name Algebra, which comes from the book by 
the astronomer Mohammed ibn Musa al-Khuwarizmi titled “Al-Jabr w’al muqabola,” 
which means “restoring” or “balancing” (equations). Al-Khuwarizmi is also responsible 
for a second profoundly influential book entitled “Kitab al jami’ wa’l tafriq bi hisab al 
hind” (Indian Technique of Addition and Subtraction), which described and clarified 
the Indian decimal place value system. 

The decline of Arab civilization coincided with the rise of European civilization. 
The dawn of the modern age began when Richard the Lionhearted reached the walls 
of Jerusalem. From approximately 1192 through around 1270, the Christian knights 
brought the learning of the “infidels” back to Europe. Around 1200-1205, Leonardo of 
Pisa (also known as Fibonacci), who had traveled extensively in Africa and Asia Minor, 
wrote his interpretation (in Latin) of Arabic and Greek mathematics. His historic texts 
brought the work of al-Khuwarizmi and Euclid to the attention of a large audience in 
Europe. 


European Mathematics 


Around 1450 Johann Gutenberg invented the printing press with movable type. This, 
combined with the advent of linen and cotton paper obtained from the Chinese, dramat- 
ically increased the rate of the dissemination of knowledge. The steep rise in trade and 
manufacturing fueled the growth of wealth and dramatic change in European societies 
from feudal to city-states. In Italy, the mother of the Renaissance, we see the rise of 
extraordinarily wealthy states such as Venice under the Doges and Florence under the 
Medicis. 

The needs of the rising merchant class accelerated the adoption of the Indian system 
of numeration. The teachings of the Catholic Church, which rested on absolute authority 
and dogma, began to be challenged by the ideas of Plato. From Plato, scholars learned that 
the world was rational and could be understood, and that the means of understanding 
nature was through mathematics. But this sharply contradicted the teachings of the 
church, which taught that God designed the universe. The only possible resolution of 
this apparent contradiction was that “God designed the universe mathematically” or that 
“God is a mathematician.” 

It is perhaps surprising how much this point of view inspired the work of many 
sixteenth- to eighteenth-century mathematicians and scientists. For if this were indeed 
the case, then by understanding the mathematical laws of the universe, one could come 
closer to an understanding of the Creator himself. Believe it or not, this point of view 
survives to this day. The following is a quote from Paul Dirac, a Nobel Prize—winning 
physicist and a creator of modern quantum mechanics. 


It seems to be one of the fundamental features of nature that fundamental physical 
laws are described in terms of a mathematical theory of great beauty and power, 
needing quite a high standard of mathematics for one to understand it. You may 
wonder: Why is nature constructed along these lines? One can only answer that 
our present knowledge seems to show that nature is so constructed. We simply 
have to accept it. One could perhaps describe the situation by saying that God is 
a mathematician of a very high order, and He used very advanced mathematics in 
constructing the universe. Our feeble attempts at mathematics enable us to 
understand a bit of the universe, and as we proceed to develop higher and higher 
mathematics we can hope to understand the universe better. 


Mathematics began to see further advances and applications. In the sixteenth and 
seventeenth centuries, al-Khuwarizmi’s algebra was significantly advanced by Cardano, 
Vieta, and Descartes. The Babylonians had solved the quadratic equation, but now two 


figure 8 Duomo. 


figure 9 Nicolaus Copernicus 
(1473-1543). 


Historical Introduction xix 


thousand years later, del Ferro and Tartaglia solved the cubic equation, which in turn 
led to the discovery of imaginary numbers. These imaginary numbers were later to 
play a fundamental role, as we shall see, in the development of vector calculus. In the 
early seventeenth century, Descartes, perhaps motivated by the grid technique used by 
Italian fresco painters to locate points on a wall or canvas, created, in a moment of great 
mathematical inspiration, coordinate (or analytic) geometry. This new mathematical 
model enables one to reduce Euclid’s geometry to algebra and provides a precise and 
quantitative method to describe and calculate with space curves and surfaces. 

Early on, Archimedes’ great work in statics and equilibrium (centers of gravity, 
the principle of the lever—which we study in this book) was absorbed and improved 
upon, leading to dramatic engineering achievements. In a building spree that remains 
astonishing to this day, engineering advances made possible the rise of an incredible 
number of cathedrals throughout Europe, including the stunning Duomo in Florence, 
Notre Dame in Paris, and the Great Cathedral in Cologne, to mention a few. See Figure 8. 


However, as in Greek times, it was astronomy that was to give mathematics its greatest 
impetus. It is not surprising that the Greek astronomers placed the Earth and not the 
sun at the center of our universe, because on a daily basis we see the sun both rise and 
set. Still, it is interesting to ask if the Greeks, who were such marvelous thinkers, at 
least tested the heliocentric theory, which places the sun at the center of the universe. 
In fact, they did. In the third century B.c., Aristarchus of Samus taught that the Earth 
and other planets move in circular orbits around a fixed sun. His hypotheses were, for 
several reasons, rejected. First, the opposing astronomers reasoned that if the Earth were 
indeed moving, one should be able to sense it. Second, how would objects, circulating 
with us, be able to stay on a moving Earth? Third, why are the clouds not lagging behind 
the moving Earth? 

Such arguments were to be used again in the sixteenth century against the Polish 
astronomer Nicolas Copernicus (see Figure 9), who in 1543 introduced the heliocentric 
theory (the planets move in orbit around the sun). His book Revolutionibus Orbium 
Coelestium (On the Revolution of the Heavenly Orbits) was to initiate the “Copernican 
revolution” in science and to give the world a new word, revolutionary. 
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figure 11 The motion of Mars. 
From Kepler’s Astronomia Nova 
(1609). 


figure 12 Kepler’s second law. 


figure 10 Johannes Kepler 
(1571-1630). 


In 1619, the German astronomer Johannes Kepler (see Figure 10), using the astro- 
nomical calculations of the Danish astronomer Tycho Brahe, showed that the planetary 
orbits were in fact elliptical, the same ellipses that the Greeks had studied as abstract 
forms some 2000 years earlier (see Figure 11). 

But Kepler’s law of elliptical orbits was only one of three laws he discovered govern- 
ing planetary motion. Kepler’s second law states that if a planet moves from a point 
A to another point B in a certain amount of time T , and also moves from A’ to 
B’ in the same time, and if S is a focus of the orbital ellipse, then the sections SAB 
are SA’B’ have equal areas (see Figure 12). Kepler’s third law was that the square of 
time T a planetary body requires to complete an orbit is proportional to a*, where a 
is the great axis of the elliptical orbit. In equation form, T? = Ka?, where K is some 
constant (we shall derive this law for circular orbits in Chapter 4). 

Profound as these observations were, an explanation of why these laws held was 
lacking. However, by the middle of the seventeenth century, it was fully understood 
that a change of velocity requires the action of forces, but how these forces influenced 
motion was not at all clear. In 1674 Robert Hooke, in an attempt to explain Kepler’s 
laws, assumed the existence of an attractive force the sun must exert on the plan- 
ets, a force that decreased with planetary distance. Hooke’s theory, however, was only 
qualitative. 


Newton 


What was also seriously lacking was a quantitative, precise definition of both velocity 
and acceleration. This was ultimately solved by the invention of calculus by both Isaac 
Newton and Gottfried Wilhelm Leibniz (see Figure 13). Hooke was never able to achieve 
an understanding of the profound ideas behind the infinitesimal calculus. However, 
during the period of 1679—1680 Hooke discussed his ideas with Newton, including his 
conjecture that the force the sun exerts on the planets was actually inversely proportional 
to the square of the planetary distance. 

After Sir Christopher Wren, amateur astronomer, architect of the city of London 
and London’s magnificent St. Paul’s Cathedral, issued a public challenge to “theoretically 
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figure 13 Gottfried Wilhelm Leibniz (1646-1716). 


figure 14 Isaac Newton, one of the greatest 
scientific and mathematical intellects of all time, 
created the notion of a vector through his 
conception of forces as being vectorial. Here 
you see him depicted in 1725 (with his natural 
hair) just two years before his death, leafing 
through the pages of his masterpiece, the 
Philosopiae Naturalis Principia Mathematica, 
arguably the most influential and profound 
scientific work ever written and the true starting 
point of vector calculus. (Source: Courtesy of 
the National Portrait Gallery, London) 


determine” the orbits of the planets, Isaac Newton took a serious interest in the problem. 
Perhaps acting on rumors, the great British astronomer Edmund Halley (1656-1743) 
in August 1684 visited Newton in Cambridge and asked him directly what the orbit 
of a planet would be under an inverse square force. Newton answered that it had to 
be an ellipse. As the stunned Halley asked him how he knew this, Newton’s famous 
reply was “Why I have calculated it.’ Halley ultimately urged Newton to publish his 
results as a book, and these appeared in 1686 in Newton’s now legendary Principia. 
See Figure 15. 

This book, often and justly referred to as the foundation of modern science, had 
an immediate dramatic impact. Alexander Pope wrote: 


Nature and nature’s laws lay hid at night, 
God said, “Let Newton be” and all was light. 


In Figure 14, we see Newton holding open a copy of his Principia. 
Although Newton did not use calculus in the Principia, convincing arguments 
have been put forward that Newton originally used his calculus to derive the trajectories 
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figure 15 The frontispiece of the 
two-lines print of the Principia, 
carrying the imprint “Prostat 
apud plures Bibliopolas,” which is 
sometimes called the “first issue” 
of the first edition. The “export 
copy” (with the three lines 
“Prostant Venales apud Sam 
Smith ...aliosq; nonnullos 
Bibliopolas”) is called the second 
issue of the first edition. This 
distinction between the first and 
second issues seems to be quit 
unfounded. It has been 
suggested that Halley made an 
agreement with Smith 
concerning foreign sales; in fact, 
most of Smith's fifty copies were 
apparently sold on the 
continent. 
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of the planetary orbits from the inverse square law.* The Principia provided profound 
evidence that the universe, as the early Greeks had understood, was indeed designed 
mathematically. Incidentally, it was Newton who first conceptualized force as a vector, 
although he provided no formal definition of what a vector was. Such a formal definition 
had to wait for William Rowan Hamilton, a century and a half after the Principia. 

The invention of the calculus and the subsequent development of vector calculus was 
the true beginning of modern science and technology, which has changed our world so 
dramatically. From the mathematics of Newton’s mechanics to the profound intellectual 
constructs of Maxwell’s electrodynamics, Einstein’s relativity, and Heisenberg’s and 
Schrédinger’s quantum mechanics, we have seen the discoveries of radio, television, 
wireless communications, flight, computers, space travel, and countless engineering 
marvels. 

Underlying all these developments was mathematics, an exciting adventure of the 
mind and a celebration of the human spirit. It is in this context that we begin our account 
of vector calculus. 


*We shall study the problem of planetary orbits in Section 4.1 and further in the Internet supplement. 
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figure P.1 The geometric 
representation of points on the 
real-number line. 


figure P.2 The geometric 
representation of the intervals 
(a Ð. (c d) and (e f). 


We assume that students have studied the calculus of functions of a real variable, in- 
cluding analytic geometry in the plane. Some students may have had some exposure to 
matrices as well, although what we shall need is given in Sections 1.3 and 1.5. 

We also assume that students are familiar with functions of elementary calculus, 
such as sin x, cosx, e*, and logx (we write logx or lnx for the natural logarithm, 
which is sometimes denoted log, x). Students are expected to know, or to review as the 
course proceeds, the basic rules of differentiation and integration for functions of one 
variable, such as the chain rule, the quotient rule, integration by parts, and so forth. 

We now summarize the notations to be used later. Students can read through these 
quickly now, then refer to them later if the need arises. 

The collection of all real numbers is denoted R. Thus R includes the integers, ..., 
—3, —2, —1, 0, 1, 2, 3, ...; the rational numbers, p/q, where p and q are integers 
(q # 0); and the irrational numbers, such as J2, x, and e. Members of R may be 
visualized as points on the real-number line, as shown in Figure P.1. 


1 J2 2 e 3 


When we write a € R we mean that a is a member of the set R, in other words, that 
a is areal number. Given two real numbers a and b witha < b (that is, with a less than 
b), we can form the closed interval [a, b], consisting of all x such that a < x < b, and 
the open interval (a, b), consisting of all x such that a < x < b. Similarly, we can form 
half-open intervals (a, b] and [a, b) (Figure P.2). 


a b c d e f 
a eo 


Closed Open Half open 


The absolute value of a number a € R is written |a| and is defined as 


a ifa>0 
lal={ 


a ifa<0O. 


For example, |3| = 3, |—3] = 3, |O| = 0, and |—6| = 6. The inequality |a + b| < 
|a| + |b| always holds. The distance from a to b is given by |a — b|. Thus, the distance 
from 6 to 10 is 4 and from —6 to 3 is 9. 

If we write A C R, we mean A is a subset of R. For example, A could equal the set 
of integers {..., —3, —2, —1, 0, 1, 2, 3, ...}. Another example of a subset of R is the 
set Q of rational numbers. Generally, for two collections of objects (that is, sets) A and 
B, A C B means A is a subset of B; that is, every member of A is also a member of B. 

The symbol A U B means the union of A and B, the collection whose members are 
members of either A or B (or both). Thus, 


[paama =2, =f, 0} UHL 031, 2c .4b SS few = 3) = 25. = 1,,.0).1, 25. chs 
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Similarly, A O B means the intersection of A and B; that is, this set consists of those 
members of A and B that are in both A and B. Thus, the intersection of the two sets on 
the preceding page is {—1, 0}. 

We shall write A\B for those members of A that are not in B. Thus, 


hac Oa Oa Ol, had = i, 


We can also specify sets as in the following examples: 


{a € R | ais an integer} = {..., —3, —2, —1, 0, 1, 2,...} 
{a € R | ais an even integer} = {..., —2, 0, 2, 4, ...} 
{x ER|a<x<b}=[a, bd]. 


A function f: A — B is arule that assigns to eacha € A one specific member f(a) of 
B. We call A the domain of f and B the target of f . The set { f(x) | x € A} consisting 
of all the values of f(x) is called the range of f. Denoted by f(A), the range is a subset 
of the target B. It may be all of B, in which case f is said to be onto B. The fact that 
the function f sends a to f(a) is denoted by a +» f(a). For example, the function 
f(x) = x?/(1 — x) that assigns the number x*/(1 — x) to each x 4 1 in R can also 
be defined by the rule x + x7/(1 — x). Functions are also called mappings, maps, or 
transformations. The notation f: A C R —> R means that A is a subset of R and that 
f assigns a value f(x) in R to each x € A. The graph of f consists of all the points 
(x, f(x)) in the plane (Figure P.3). 


Graph of f 


(x, s J 


figure P.3 The graph of a function with 
the half-open interval Aas domain. 


A = domain 


The notation }>7_, a; means a, +- +--+ an, where a), ..., a, are given numbers. The 
sum of the first n integers is 


n 1 
1424n = Di s D, 
i=l 


The derivative of a function f(x) is denoted f'(x), or 


df 
dx 


> 
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and the definite integral is written 


a 


If we set y = f(x), the derivative is also denoted by 


dy 
dx 


Readers are assumed to be familiar with the chain rule, integration by parts, and other 
basic facts from the calculus of functions of one variable. In particular, they should know 
how to differentiate and integrate exponential, logarithmic, and trigonometric functions. 
Short tables of derivatives and integrals, which are adequate for the needs of this text, 
are printed at the front and back of the book. 

The following notations are used synonymously: e* = expx,Inx = logx, and 
sin” x = arcsin x. 

The end of a proof is denoted by the symbol W, while the end of an example or remark 
is denoted by the symbolA. 
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Tne Geometry of 
Euclidean Space 


Quaternions came from Hamilton ... and have been an unmixed evil to those who have 
touched them in any way. Vector is a useless survival ... and has never been of the slightest 


use to any creature. —Lord Kelvin 


In this chapter we consider the basic operations on vectors in two- and 
three-dimensional space: vector addition, scalar multiplication, and the 
dot and cross products. In Section 1.5 we generalize some of these no- 
tions to nspace and review properties of matrices that will be needed 


in Chapters 2 and 3. 


1.1 Vectors in Two- and Three-Dimensional Space 


az 


y 
A 
Peg nee 
J i 
J} 
J | 
f l 
>y l > x 
KW 
ay 


figure 1.1.1 Cartesian 
coordinates in the plane. 


Points P in the plane are represented by ordered pairs of real numbers (ai, a2); the 
numbers a; and az are called the Cartesian coordinates of P. We draw two perpendicular 
lines, label them as the x and y axes, and then drop perpendiculars from P to these axes, 
as in Figure 1.1.1. After designating the intersection of the x and y axes as the origin 
and choosing units on these axes, we produce two signed distances a and az as shown 
in the figure; a; is called the x component of P, and az is called the y component. 

Points in space may be similarly represented as ordered triples of real numbers. To 
construct such a representation, we choose three mutually perpendicular lines that meet 
at a point in space. These lines are called x axis, y axis, and z axis, and the point at 
which they meet is called the origin (this is our reference point). We choose a scale on 
these axes, as shown in Figure 1.1.2. 

The triple (0, 0, 0) corresponds to the origin of the coordinate system, and the arrows 
on the axes indicate the positive directions. For example, the triple (2, 4, 4) represents 
a point 2 units from the origin in the positive direction along the x axis, 4 units in the 
positive direction along the y axis, and 4 units in the positive direction along the z axis 
(Figure 1.1.3). 

Because we can associate points in space with ordered triples in this way, we often 
use the expression “the point (a1, a2, a3)” instead of the longer phrase “the point P 
that corresponds to the triple (a1, a2, a3).” We say that a; is the x coordinate (or first 
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figure 1.1.2 Cartesian coordinates in space. 


figure 1.1.3 Geometric representation of the point 
(2, 4, 4) in Cartesian coordinates. 


coordinate), a2 is the y coordinate (or second coordinate), and a3 is the z coordinate 
(or third coordinate) of P. Itis also common to denote points in space with the letters 
x, y, and z in place of ay, az, and a3. Thus, the triple (x, y, z) represents a point whose 
first coordinate is x, second coordinate is y, and third coordinate is z. 

We employ the following notation for the line, the plane, and three-dimensional 
space: 


(i) The real number line is denoted R! or simply R. 

(ii) The set of all ordered pairs (x, y) of real numbers is denoted R?. 

(iii) The set of all ordered triples (x, y, z) of real numbers is denoted IR?. 

When speaking of R}, R?, and R? simultaneously, we write R”, wheren = 1, 2, or 3; or 
R”, wherem = 1, 2, 3. Starting in Section 1.5 wewill also study R” forn = 4,5, 6,..., 


but the cases n = 1, 2, 3 are closest to our geometric intuition and will be stressed 
throughout the book. 


Vector Addition and Scalar Multiplication 


The operation of addition can be extended from R to R? and R3. For R3, this is done as 
follows. Given the two triples (a1, a2, a3) and (bı, b2, b3), we define their sum to be 


(a1, a, a3) + (bı, b2, b3) = (a1 + bı, az + b2, a3 + b3). 
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example | (1,1, 1) + (2, —3, 4) = (3, —2, 5), 
(x, y, z) + (0, 0,0) = (x, y, z), 
(1,7,3)+(a,b,c)=(lta,7+b,34+c) A 


The element (0, 0, 0) is called the zero element (or just zero) of R?. The element 
(—a1, —a2, —a3) is the additive inverse (or negative) of (a1, a2, a3), and we will write 
(a1, az, a3) — (bi, bz, b3) for (ay, an, a3) + (—b1, —b2, —b3). 

The additive inverse, when added to the vector itself, of course produces zero: 


(a1, a2, a3) + (—a1, —az, —a3) = (0, 0, 0). 


There are several important product operations that we will define on R?. One of 
these, called the inner product, assigns a real number to each pair of elements of R?. 
We shall discuss it in detail in Section 1.2. Another product operation for IR? is called 
scalar multiplication (the word “scalar” is asynonym for “real number”). This product 
combines scalars (real numbers) and elements of IR? (ordered triples) to yield elements of 
R? as follows: Given a scalar œ and atriple (a1, a2, a3), we define the scalar multiple by 


alay, a2, a3) = (aay, aa2, qaz). 


example 2 2(4,e, 1) = (2-4,2 -e,2 - 1) = (8, 2e, 2), 
6(1,1, 1) = (6, 6, 6), 
l(u,v,w) = (u, v, w), 
O(p,g,r) =(0,0,0) A 


Addition and scalar multiplication of triples satisfy the following properties: 


associativity) 
distributivity) 
distributivity) 


(wB)(a1, a2, a3) = a[ B(a1, a2, a3)] 
(ii 


(a + B)(a1, az, a3) = ala, az, a3) + Bai, a2, a3) 


i) 
) 
iii) œ[(a1, a2, a3) + (by, bz, b3)] = aay, az, a3) + a(b, bz, b3) 
) 
) 
) 


ee 


(iv) o(0, 0, 0) = (0, 0, 0) property of zero) 
(v) Olai, az, a3) = (0, 0, 0) property of zero) 
(vi) (ay, a2, a3) = (ay, a, a3) property of the 


unit element) 


The identities are proven directly from the definitions of addition and scalar multi- 
plication. For instance, 


= ((~ + B)ai, (œ + B)a2, (œ + B)a3) 
= (aa, + Bay, waz + Baz, wa3 + Ba3) 
— (ay, az, a3) + plan, a2, a3). 


(æ + B)(a1, a2, a3) 


For R?, addition and scalar multiplication are defined just as in R3, with the third 
component of each vector dropped off. All the properties (i) to (vi) still hold. 
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example 3 


solution 


Interpret the chemical equation 2NH2 + H2 = 2NH3 as a relation in the algebra of 
ordered pairs. 


Wethink of themoleculeN „H , (x atoms of nitrogen, y atoms of hydrogen) as represented 
by the ordered pair (x, y). Then the chemical equation given is equivalent to 2(1, 2) + 
(0, 2) = 2(1, 3). Indeed, both sides are equal to (2,6). A 


Geometry of Vector Operations 


Let us turn to the geometry of these operations in R? and R?. For the moment, we define 
a vector to be a directed line segment beginning at the origin; thatis, a line segment with 
specified magnitude and direction, and initial point at the origin. Figure 1.1.4 shows 
several vectors, drawn as arrows beginning at the origin. In print, vectors are usually 
denoted by boldface letters such as a By hand, we usually write them as a or simply as 
a, possibly with a line or wavy line under it. 

Using this definition of a vector, we associate with each vector athe point (a1, a2, 
a3) where a terminates, and conversely, we can associate a vector a with each point 
(a1, a2, a3) in space. Thus, we shall identify awith (a1, a2, a3) and writea= (aj, az, a3). 
For this reason, the elements of R? not only are ordered triples of real numbers, but are 
also regarded as vectors. The triple (0, 0, 0) is denoted @ We call aj, a2, and a3 the 
components of a or when we think of aas a point, its coordinates. 

Two vectors a = (a1, a2, a3) and b = (bı, b2, b3) are equal if and only if a; = 
bı, a = bz, and a3 = b3. Geometrically this means that aand bhave the same direction 
and the same length (or “magnitude”). 

Geometrically, we define vector addition as follows. In the plane containing the 
vectors a= (ay, az, a3) and b= (hj, b2, b3) (See Figure 1.1.5), form the parallelogram 


figure 1.1.4 Geometrically, vectors are thought 
of as arrows emanating from the origin. 


figure 1.1.5 The geometry of vector addition. 
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figure 1.1.6 A physical interpretation of vector addition. 


having aas one side and bas its adjacent side. The sum a+ bis the directed line segment 
along the diagonal of the parallelogram. 

This geometric view of vector addition is useful in many physical situations, as we 
Shall see in the next section. For an easily visualized example, consider a bird or an 
airplane flying through the air with velocity vı, but in the presence of a wind with 
velocity w. The resultant velocity, vı + Wp, is what one sees; see Figure 1.1.6. 

To show that our geometric definition of addition is consistent with our algebraic 
definition, we demonstrate that a+ b= (a; + bı, az + b2, a3 + b3). We shall prove this 
result in the plane and leave the proof in three-dimensional space to the reader. Thus, 
we wish to show that if a= (a1, a2) and b= (bı, b2), then a+ b= (a; + bı, az + b2). 

InFigure 1.1.7 leta= (aj, a2) bethe vector ending atthe pointA , andlet b= (by, b2) 
be the vector ending at point B. By definition, the vector a+ bends at the vertex C of 
parallelogram OBCA. To verify that a+ b= (a; + bı, a2 + b2), it suffices to show that 
the coordinates of C are (a; + bı, a2 + b2). The sides of the triangles OAD and BCG 
are parallel, and the sides OA and BC have equal lengths, which we writeas OA = BC. 
These triangles are congruent, so BG = OD; since BGFE is a rectangle, EF = BG. 
Furthermore, OD = a; and OE = bı. Hence, EF = BG = OD = a1. Since OF = EF + 
OE, it follows that OF = a; + bı. This shows that the x coordinate of a+ bis a; + bı. 
The proof that the y coordinate is az + bz is analogous. This argument assumes A and 
B to be in the first quadrant, but similar arguments hold for the other quadrants. 

Figure 1.1.8(a) illustrates another way of looking at vector addition: in terms of 
triangles rather than parallelograms. That is, we translate (without rotation) the directed 
line segment representing the vector bso that it begins at the end of the vector a The 
endpoint of the resulting directed segment is the endpoint of the vector a+ b. We 
note that when a and bare collinear, the triangle collapses to a line segment, as in 
Figure 1.1.8(b). 


> 


A = (a, a) 


figure 1.1.7 The construction used to prove 
that (a7, a2) + (b1, b2) = (ai + bi, a2 + b2). 
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figure 1.1.8 (a) Vector addition 
may be visualized in terms of 
triangles as well 

as parallelograms. (b) The 
triangle collapses to a line 
segment when a and b are 
collinear. 


ptransiated 


a+b 
btransl ated 


(a) 


> xX 


(b) 


In Figure 1.1.8 we have placed aand b head to tail. That is, the tail of bis placed 
at the head of a and the vector a+ bgoes from the tail of ato the head of b. If we do 
itin the other order, b+ a we get the same vector by going around the parallelogram 
the other way. Consistent with this figure, it is useful to let vectors “glide” or “slide,” 
keeping the same magnitude and direction. We want, in fact, to regard two vectors as 
the same if they have the same magnitude and direction. When we insist on vectors 
beginning at the origin, we will say that we have bound vectors. If we allow vectors to 
begin at other points, we will speak of free vectors or just vectors. 


vectors Vectors (also called free vectors) are directed line segments in [the plane 
or] space represented by directed line segments with a beginning (tail) and an end 
(head). Directed line segments obtained from each other by parallel translation 
(but not rotation) represent the same vector. 

The components (a1, a2, a3) of a are the (signed) lengths of the projections 
of aalong the three coordinate axes; equivalently, they are defined by placing 
the tail of aat the origin and letting the head be the point (aj, az, a3). We write 


a= (a1, a2, a3). 


Two vectors are added by placing them head to tail and drawing the vectors 
from the tail of the first to the head of the second, as in Figure 1.1.8. 


Scalar multiplication of vectors also has a geometric interpretation. If œ is a scalar 
and aa vector, we define «ato be the vector that is |a| times as long as a, with the same 
direction as aif œ > 0, but with the opposite direction if œ < 0. Figure 1.1.9 illustrates 
several examples. 


> xX 


> 


figure 1.1.9 Some scalar multiples of a vector a. 


"a 


> X > X 


example 4 


solution 
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figure 1.1.10 The geometry of vector subtraction. 


Using an argument based on similar triangles, we find that if a= (a1, az, a3), and a 
is a scalar, then 


aa= (adj, aa2, aa3). 


That is, the geometric definition coincides with the algebraic one. 

Given two vectors aand b, how do we represent the vector b— ageometrically, that 
is, whatis the geometry of vector subtraction? B ecausea+(b—a) = b, we see that b-a 
is the vector that we add to ato get b In view of this, we may conclude that b— ais the 
vector parallel to, and with the same magnitude as, the directed line segment beginning 
at the endpoint of aand terminating at the endpoint of bwhen aand bbegin at the same 
point (see Figure 1.1.10). 


Let uand vbe the vectors shown in Figure 1.1.11. Draw the two vectors u+ vand —2u. 
W hat are their components? 


figure 1.1.11 Find u + v and —2u. 


Place the tail of wat the tip of uto obtain the vector shown in Figure 1.1.12. 

The vector —2u, also shown, has length twice that of uand points in the opposite 
direction. From the figure, we see that the vector u+ v has components (5, 2) and —2u 
has components (—6, —4). 
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figure 1.1.12 Computing u + v and —2u. A 


example 5 | (a) Sketch —2v, where v has components (—1, 1, 2). 


(b) If wand ware any two vectors, show that v — ;wand 3v— ware parallel. 


solution (a) The vector —2v is twice as long as v, but points in the opposite direction (see 
Figure 1.1.13). 


>N 


(-1,1, 2) 


© (2,-2,-4) 


figure 1.1.13 Multiplying (-1, 1, 2) by —2. 


(b) v— s=w= 3(3V— w); vectors that are multiples of one another are parallel. A 


1 
3 
The Standard Basis Vectors 

To describe vectors in space, it is convenient to introduce three special vectors along 
the x, y, and z axes: 

i: the vector with components (1, 0, 0) 

j: the vector with components (0, 1, 0) 


k: the vector with components (0, 0, 1). 


example 6 


solution 


example 7 
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figure 1.1.14 The standard basis vectors. 


These standard basis vectors are illustrated in Figure 1.1.14. In the plane we have the 
standard basis i and j with components (1, 0) and (0, 1). 
Let abe any vector, and let (a1, a2, a3) be its components. Then 


a= qi+ aj + a3K, 
because the right-hand side is given in components by 


aı(1, 0, 0) + a2(0, 1, 0) + a3(0, 0, 1) = (ar, 0, 0) + (0, a2, 0) + (0, 0, as) 


= (a, a2, a3). 


Thus, we can express every vector as a sum of scalar multiples of i, j, and K. 


The Standard Basis Vectors 


1. The vectors i, j, and Kare unit vectors along the three coordinate axes, as 
shown in Figure 1.1.14. 


2. If ahas components (a1, a2, a3), then 


a= ai ar aj ar a3k. 


Express the vector whose components are (e, x, —/3) in the standard basis. 
Substituting aı = e, a2 = x, and a3 = — v3 into a= ai+ aj + a3kgives 


v=eit+azj—v3k 4 


Thevector (2, 3, 2) equals 2i+-3j+2k, andthe vector (0, —1, 4) is—j+ 4k Figure 1.1.15 
shows 2i+ 3j + 2k you draw inthevector-j+4k A 


Addition and scalar multiplication may be written in terms of the standard basis 
vectors as follows: 


(aiñ + aj + azk) + (bii + bj + b3k) = (a1 + bı)i + (a2 + b2)j + (a3 + b3)k 


and 


alai + az) + a3k) = (aa1)i+ (wa)j + a(a3)k 
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figure 1.1.16 The 

vector from P to 
—, 

P’ is denoted PP’. 


0| 


figure 1.1.17 
PB =OP - OÈ. 


example 8 


solution 


example 9 


figure 1.1.15 Representation of (2, 3, 2) 
in terms of the standard basis vectors i, j, 
and k. 


x 


The Vector Joining Two Points 


To apply vectors to geometric problems, itis useful to assign a vector to a pair of points 
in the plane or in space, as follows. Given two points P and P’, we can draw the vector 
vwith tail P and head P’, as in Figure 1.1.16, where we write PP’ for v. 

If P = (x, y,z) and P’ = (x’, y’, z’), then the vectors from the origin to P and P’ 
are a = xi + yj + zkanda = x'i + yj + z’k, respectively, so the vector PP’ is the 
difference a — a= (x’ — x)i+(y’ — y)j + (z' — z)k (See Figure 1.1.17.) 


The Vector Joining Two Points If the point P has coordinates (x, y, z) and 
P’ has coordinates (x’, y’, z’), then the vector PP’ from the tip of P to the tip of P’ 
has components (x’ — x, y’ — y, z’ — z). 


(a) Find the components of the vector from (3, 5) to (4, 7). 
(b) Add the vector vfrom (—1, 0) to (2, —3) and the vector wfrom (2, 0) to (1, 1). 


(c) Multiply the vector vin (b) by 8. If the resulting vector is represented by the 
directed line segment from (5, 6) to Q, what is Q? 


(a) Asin the preceding box, we subtract the ordered pairs: (4, 7) — (3, 5) = (1, 2). 
Thus the required components are (1, 2). 


(b) The vector whas components (2, —3) — (—1, 0) = (3, —3), and whas 
components (1, 1) — (2, 0) = (—1, 1). Therefore, the vector v+ whas 
components (3, —3) + (—1, 1) = (2, —2). 


(c) The vector 8vhas components 8(3, —3) = (24, —24). If this vector is represented 
by the directed line segment from (5, 6) to Q, and Q has coordinates (x, y), then 
(x, y) — (5, 6) = (24, —24), so (x, y) = (5, 6) + (24, —24) = (29, —18). A 


Let P = (—2, —1), Q = (—3, —3), and R = (—1, —4) in the xy plane. 

(a) Draw these vectors: vjoining P to Q; wjoining Q to R; ujoining R to P. 
(b) What are the components of v, w, and u? 

(c) Whatis v+ w+ u? 
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solution (a) See Figure 1.1.18. 


3 figure 1.1.18 The vector v joins P to Q; w joins Q to 
M> R; and ujoins R to P. 


v= (—3, —3) —(-2 = ( 
w= (1, —4) — (-3, —3) = (2 
u= 


(c) v+w+ u= (—1, —2) + (2, —1) + (—1, 3) = (0,0). A 


Geometry Theorems by Vector Methods 


M any of the theorems of plane geometry can be proved by vector methods. H ere is one 
example. 


example 10 | Use vectors to prove that the diagonals of a parallelogram bisect each other. 


solution Let OPRQ be the parallelogram, with two adjacent sides represented by the vectors 
a = OP and b = 00. Let M be the midpoint of the diagonal OR, and let N be the 
midpoint of the other diagonal, PQ. (See Figure 1.1.19.) 


figure 1.1.19 If the midpoints M and N coincide, then 
the diagonals OR and PQ bisect each other. 


Observe that OR = OP + 06 = = a+ bby the parallelogram rule for vector addition, 
so OM = 10R = 5(a+ b). On the other hand, 


D-0 -0P -b-a so PN =3PQ = }(b- a, 
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figure 1.1.20 The line /, 
parametrically given by 

I) =a + tv, lies in the direction v 
and passes through the tip of a. 


example 11] 


and hence, 
ON =OP +PN = a+ }(b—a = (a+b). 


Because OM and ON are equal vectors, the points M and N coincide, so the diagonals 
bisect each other. A 


Equations of Lines 


Planes and lines are geometric objects that can be represented by equations. We shall 
defer until Section 1.3 a study of equations representing planes. However, using the 
geometric interpretation of vector addition and scalar multiplication, we will now find 
the equation ofa line | that passes through the endpoint of the vector a, with the direction 
of a vector (see Figure 1.1.20); that is, line Z is parallel to the vector v. 


As t varies through all real values, the points of the form rware all scalar multiples 
of the vector v, and therefore exhaust the points of the line passing through the origin 
in the direction of v. Because every point on / is the endpoint of the diagonal of a 
parallelogram with sides aand rvfor some real value of r, we see that all the points on / 
are of the form a+ rv. Thus, the line Z may be expressed by the equation I(r) = a+ tv. 
We say that / is expressed parametrically, with ¢ the parameter. Att = 0, (rt) =a As 
t increases, the point I(t) moves away from ain the direction of v. As + decreases from 
t = 0 through negative values, I(r) moves away from ain the direction of —v. 


Point-Direction Form of a Line 


ee an 
y=yi+ dt, 
B= Bae Gh, 


where a= (x1, yi, zı) and v = (a, b, c). For lines in the xy plane, we simply drop 
the z component. 


Determine the equation of the line Z passing through (1, 0, 0) in the direction j. See 
Figure 1.1.21. 


solution 


example 12 


solution 


example 13 


solution 
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> N 


(1,0, 0) (7) 


x 


figure 1.1.21 The line / passes through the tip of i in the direction j. 


The desired line can be expressed parametrically as I+) = i+rj.1n terms of coordinates, 


Kz) = (1, 0,0) +2(0, 1,0) =(1,7,0). 4 


(a) Find the equations of the line in space through the point (3, —1, 2) in the direction 
2i — 3j + 4k. 


(b) Find the equation of the line in the plane through the point (1, —6) in the direction 
of 5i — xj. 


(c) In what direction does the line x = —3r+ 2, y = —2(t — 1), z = 8t + 2 point? 


(a) Here a= (3, —1, 2) = (x1, yı, z1) and v = 2i — 3j + 4k, so a = 2, b = —3, and 
c = 4. From the preceding box, the equations are 


x=3+2t, y=-l- 3, z=2+ 4t. 
(b) Here a= (1, —6) and v= 5i — xj, so the required line is 
Kr) = (1, —6) + (5t, —xt) = (1 + 5t, —6 — xt); 
that is, 
x=1+5, y=-6—a2t. 
(c) Using the box, we construct the direction v= aì + bj + ckfrom the 


coefficients of t: a = —3, b = —2, c = 8. Thus, the line points in the direction 
of v=-—3i-2j+8k A 


Do thetwo lines (x, y, z) = (t, —6t +1, 2t —8) and (x, y, z) = (3t +1, 2t, 0) intersect? 


If the lines intersect, there must be numbers rf; and tz such that the corresponding points 
are equal: 


(th, —ô6tı +1, 2t = 8) = (3t +1, 2h, 0); 
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that is, all three of the following equations hold: 


ti = 3h +1, 
—64 +1= 2h, 
2t —8=0. 


From the third equation, t; = 4. The first equation then becomes 4 = 34 + 1; that is, 
tı = 1. We must check whether these values satisfy the middle equation: 


-6n +1 = 2h, 
Since t = 4 and n = 1, this reads 

~244+142, 
which is false, so the lines do not intersect. A 


Notice that there can be many equations of the same line. Some may be obtained by 
choosing, instead of a a different point on the given line, and forming the parametric 
equation of the line beginning at that point and in the direction of v. For example, the 
endpoint of a+ vis on the line I(t) = a+ rv, and thus, h (z) = (a+ v) + tv represents 
the same line. Still other equations may be obtained by observing that if a 4 0, the 
vector av has the same (or opposite) direction as v. Thus, h(t) = a+ tævis another 
equation of I(r) = a+ rv. 

For example, both Kr) = (1,0, 0) + (z,t,0) and h(s) = (0, —1, 0) + (s, s, 0) 
represent the same line since both are in the direction i + j and both pass through the 
point (1, 0, 0); I passes through (1, 0, 0) att = 0 and |, passes through (1, 0, 0) ats = 1. 

Therefore, the equation of a line is not uniquely determined. Nevertheless, it is 
customary to use the term “the equation of a line.” Keeping this in mind, let us derive 
the equation of a line passing through the endpoints of two given vectors a and b. 
Because the vector b— ais parallel to the directed line segment from ato b, we 
calculate the parametric equation of the line passing through ain the direction of b— a 
(Figure 1.1.22). Thus, 


I(t) =a+r¢(b-—a); that is, I(t) = (1 — t)a+ tb. 


Ast increases from 0 to 1, r(b— a) starts as the zero vector and increases in length 
(remaining in the direction of b— a) until at ż = 1 it is the vector b— a Thus, for 
Kt) = a++t(b— aj, as t increases from 0 to 1, the vector I(t) moves from the endpoint 
of ato the endpoint of balong the directed line segment from ato b. 


l 


figure 1.1.22 The line /, parametrically 
given by I() = a + Kb — a) = 

(1 — Na + tb, passes through the tips 
of a and b. 
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If P = (x1, yz, zı) is the tip of aand Q = (x2, y2, z2) is the tip of b, then v = 
(x2 — x1)i+ ( y2 — yi)j + (z2 — 21), and so the equations of the line are 


x = x1 + (x2 — xı)t, 
y=yt(y-yit, 
z =z + (z2 — zı)t. 


By eliminating z, these can be written as 


x=x  Y-yl 2721 


X2 — X1 y2 — yı 42 = z1 


Parametric Equation of a Line: Point-Point Form The parametric 
equations of the line / through the points P = (x1, yı, zı) and Q = (x2, y2, z2) are 


x = x1 + (x2 — x1)t, 
y= y a= 


z = z1 + (z2 — zı1)t, 


where (x, y, z) isthe general point of 7, and the parameter ¢ takes on all real values. 


example 14 | Find the equation of the line through (2, 1, —3) and (6, —1, —5). 


solution Using the preceding box, we choose (x, y1, zı) = (2,1, —3) and (x2, y2, z2) = 
(6, —1, —5), so the equations are 


x=2+(6—2)t=24+ 4, 
y=14+(-1-1)r=1-22, 
z = —3 + (—5 — (—3)) = -3 - 2t. A 


example 15 | Find the equation of the line passing through (—1, 1, 0) and (0, 0, 1) (see Figure 1.1.23). 


X 


figure 1.1.23 Finding the equation of the line through two points. 
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solution 


example 16 


solution 


example 17 


solution 


Letting a= —i+j and b= krepresent the given points, we have 
Kt) = (1 —2)(-i+j) +r+k = —(1 -ż)i+ (1 -t)j+tk. 
The equation of this line may thus be written as 
Kt) =(¢-—1i+ (1-4) j+c¢k 
or, equivalently, if +) = xi+ yj + zk, 
x=t-l, y=1-t, z=t. A 


The description of a line segment requires that the domain of the parameter + be 
restricted, as in the following example. 


Find the equation of the line segment between (1, 1, 1) and (2, 1, 2). 


The dine through (1, 1, 1) and (2, 1, 2) is described in parametric form by (x, y, z) = 
(1+ +,1,1-+ 14), as ¢ takes on all real values. When ¢ = 0, the point (x, y, z) is 
(1, 1, 1), and when ¢ = 1, the point (x, y, z) is (2, 1, 2). Thus, the point (x, y, z) lies 
between (1, 1, 1) and (2, 1, 2) when 0 < ¢ < 1, so the line segment is described by the 
equations 


x=1+t, 
y=1, 
z=1l+t, 


together with theinequalitiesO<t<1. A 
We can also give parametric descriptions of geometric objects other than lines. 


Describe the points that lie within the parallelogram whose adjacent sides are the vectors 
aand bbased at the origin (“within” includes points on the edges of the parallelogram). 


Consider Figure 1.1.24. If P is any point within the given parallelogram and we construct 
lines 2; and 7/2 through P parallel to the vectors a and b, respectively, we see that /; 
intersects the side of the parallelogram determined by the vector b at some point rb, 
where 0 < ż < 1. Likewise, /2 intersects the side determined by the vector a at some 
point sa, where0 <s <1. 


h P 
a figure 1.1.24 Describing points within the 
3 h parallelogram formed by vectors a and b, with 
vertex 0. 
(0) — L2 
b 


NotethatP isthe endpoint of the diagonal of a parallelogram having adjacent sides sa 
and rb; hence, if v denotes the vector OP, we see that v = sa+ tb. We conclude that all 


figure 1.1.25 Describing points P 
in the plane formed from vectors 
vandw. 
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the points in the given parallelogram are endpoints of vectors of the form sa+ tb for 
0 <s <1land0 <tr <1. Reversing our steps, we see that all vectors of this form end 
within the parallelogram. A 


As two different lines through the origin determine a plane through the origin, so do 
two nonparalle! vectors. If we apply the same reasoning as in Example 17, we see that 
the entire plane formed by two nonparallel vectors vand weonsists of all points of the 
form sw-+ tw, where s and ¢ can be any real numbers, as in Figure 1.1.25. 

We have thus described the points in the plane by two parameters. For this reason, we 
say the plane is two-dimensional. Similarly, a line is called one-dimensional whether 
it lies in the plane or in space or is the real number line itself. 

The plane determined by vand wis called the plane spanned by vand w When vis 
a scalar multiple of wand w+ Q then vand ware parallel and the plane degenerates to 
a straight line. When v= w= O(i.e.,, both are zero vectors), we obtain a single point. 

There are three particular planes that arise naturally in a coordinate system and that 
will be useful to us later. We call the plane spanned by vectors i and j the xy plane, the 
plane spanned by j and kthe yz plane, and the plane spanned by i and kthe xz plane. 
These planes are illustrated in Figure 1.1.26. 


figure 1.1.26 The three coordinate 
planes. 


The Geometry of Euclidean Space 


exercises 


(Exercises with colored numbers are solved in the Study Guide.) 


Complete the computations in Exercises I to 4. 


L (-21, 23) — (?, 6) = (—25, ?) 


2 3(133, —0.33, 0) + (—399, 0.99, 0) = (?, ?, ?) 


3. (8a, —2b, 13c) = (52, 12, 11) + 4(2, ?,?) 


4 (2, 3,5) — 41+ 3j = (?, 2, ?) 


In Exercises 5 to 8, sketch the given vectors Wand W. On your sketch, draw in —V, V+ W, and V — W 


5. v= (2, 1) and w= (1, 2) 

6. v= (0, 4) and w= (2, —1) 

7. v= (2, 3, —6) and w= (—1, 1, 1) 
8. v= (2, 1, 3) and w= (2, 0, —1) 


9, Let v= 2i+j and w= i + 2j. Sketch 
v, wọ V+ w, 2w and v— win the plane. 


10. Sketch (1, —2, 3) and (—}, 3, —1). Why do these 
vectors point in opposite directions? 


11. What restrictions must be made on x, y, and z so that the 
triple (x, y, z) will represent a point on the y axis? On 
the z axis? In the xz plane? In the yz plane? 


12. (a) Generalize the geometric construction in Figure 
1.1.7 to show that if vı = (x, y, z) and = (x’, y’, z’), 
then v + W% = (x +x’, y +y,z +z’). 


(b) Using an argument based on similar triangles, prove 
that av = (ax, æy, az) when v= (x, y, z). 


In Exercises 13 to 19, use set theoretic or vector notation or both to describe the points that lie in the given configurations. 


13. The plane spanned by vı = (2, 7, 0) and v2 = (0, 2, 7) 


14. The plane spanned by vı = (3, —1, 1) and w = (0, 3, 4) 


15. The line passing through (—1, —1, —1) in the direction 
ofj 


16. The line passing through (0, 2, 1) in the direction of 
2i— k 


17. The line passing through (—1, —1, —1) and (1, —1, 2) 
18. The line passing through (—5, 0, 4) and (6, —3, 2) 


19. The parallelogram whose adjacent sides are the vectors 
i+ 3kand —2j 


20. Show that (+) = (1, 2, 3) + ¢(1, 0, —2) and 
b(t) = (2, 2, 1) + t(—2, 0, 4) parametrize the 
same line. 


2L Do the points (2, 3, —4), (2, 1, —1), and (2, 7, —10) lie 
on the same line? 


22. Let u= (1, 2), v= (—3, 4), and w= (5, 0): 


(a) Draw these vectors in R2. 


(b) Find scalars Az and Az such that w= A ,U+ A2dv. 


23. Suppose A, B, and C are vertices of a triangle. Find 
AB +BC+ CÀ. 


24. Find the points of intersection of the line 
x=342t,y=7+4 81,2 = —2 + t, thatis, I(t) = 
(3+ 2t, 7 + 8t, —2 + t), with the coordinate planes. 


25. Show that there are no points (x, y, z) satisfying 
2x — 3y + z — 2 = 0 and lying on the line 
v= (2, —2, —1) + ¢(1, 1, 1). 


26. Show that every point on the line 
v= (1, —1, 2) + ¢(2, 3, 1) satisfies the equation 
5x —3y—-z—6=0. 


27. Determine whether the lines 
x=3t+2,y=rt-—1,z=6t+1,and 
x = 3s — l, y = s — 2, z = s intersect. 


28. Do the lines (x, y, z) = (t + 4, 4t + 5, t — 2) and 
(x, y, z) = (2s + 3, s + 1, 2s — 3) intersect? 
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In Exercises 29 to 31, use vector methods to describe the given configurations. 


29. The parallelepiped with edges the vectors a, b, and c 
emanating from the origin 


30. The points within the parallelogram with one corner 
at (xo, yo, zo) whose sides extending from that 


Prove the statements in Exercises 32 to 34. 
32. The line segment joining the midpoints of two sides of a 
triangle is parallel to and has half the length of the third 
side. 


33. If PQR is atriangle in space and b > 0 is a number, then 


Problems 35 and 36 require some knowledge of chemical notation. 


35. Write the chemical equation CO +H20 = H2 + CO2 as 
an equation in ordered triples (x1, x2, x3), where 
X1, x2, x3 are the number of carbon, hydrogen, and 
oxygen atoms, respectively, in each molecule. 


36. (a) Write the chemical equation pC3H403 + gO2 = 
rCO2 + sH20 as an equation in ordered triples with 
unknown coefficients p, q, r, and s. 


corner are equal in magnitude and direction to vectors a 
and b 


3L The plane determined by the three points 


(xo, yo. zo), (x1, y1 z1), and (x2, y2, z2) 


there is a triangle with sides parallel to those of PQR and 
side lengths b times those of PQR. 


34. The medians of a triangle intersect at a point, and this 


point divides each median in a ratio of 2:1. 


(b) Find the smallest positive integer solution for 
p.q, r ands. 


(c) Illustrate the solution by a vector diagram in space. 


a Find a line that lies entirely in the set defined by the 


equation x? + y? — z2? = 1. 


1.2 The Inner Product, Length, and Distance 


In this section and the next we shall discuss two products of vectors: the inner product 
and the cross product. These are very useful in physical applications and have interesting 
geometric interpretations. The first product we shall consider is called the inner product. 
The name dot product is often used instead. 


The Inner Product 


Suppose we have two vectors aand bin R? (Figure 1.2.1) and we wish to determine 
the angle between them, that is, the smaller angle subtended by aand bin the plane 
that they span. The inner product enables us to do this. Let us first develop the concept 


figure 1.2.1 6 is the angle between the vectors 
a and b. 
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example | 


solution 


formally and then prove that this product does what we claim. Let a = aii + a2j + a3k 
and b = bii + b2j + b3k We define the inner product of aand b, written a- b, to be 
the real number 


a b= aıbı + azb2 + a3b3. 


Note that the inner product of two vectors is a scalar quantity. Sometimes the inner 
product is denoted (a, b); thus, (a b) and a- b mean exactly the same thing. 


(a) If a= 3i+j —2kandb=i—j+k calculate a- b. 
(b) Calculate (2i+ j — K) - (3k — 2j). 


(a) a-b=3-1+41-(—-1) +(-2)-1=3-1-2=0. 


(b) (2i+j —k) -(3k— 2j) = (2i+ j — k) - (0i — 2j + 3k) 
=2-0-1-2-1-3=-5. A 


Certain properties of the inner product follow from the definition. If a b, and care 
vectors in R? and «œ and £ are real numbers, then 


(i) a-a>0; 
a-a=0 if and only if a=0 


(ii) «a. b= g(a- b) and a- Bb= g(a. b). 
(iii) a-(b+0 =a-b+a.c and (a+ b)-c=a-c+b-c 
(iv) ab=b-a 


To prove the first of these properties, observe that if a= a,i+ aj +a3k, thena-a= 
a? +a? +a3. Because aj, az, and a3 are real numbers, we know a? > 0, a? > 0, af > 0. 
Thus, a-a > 0. Moreover, if a? + a3 + a? = 0, then a, = a = a3 = 0; therefore, 
a= 0 (zero vector). The proofs of the other properties of the inner product are also 
easily obtained. 

It follows from the Pythagorean theorem that the length of the vector a= a,i+ a2j+ 
azkis \/at + aż + a3 (see Figure 1.2.2). The length of the vector ais denoted by ||all. 
This quantity is often called the norm of a Because a- a= a? + a? + a2, it follows that 


lal = (a-a. 


N 


J 2 2 2 
ai + ay + a3 


figure 1.2.2 The length of the vector 
a = (a1, @, As) is given by the Pythagorean 


formula: a + ae + a. 


example 2 


solution 


figure 1.2.3 The vectors a, b, 
and ¢ are represented by the 
sides of an equilateral triangle. 


sin @ 


cos 8 


figure 1.2.4 The coordinates of 
i, are cos 0 and sin 9; it is a unit 
vector because 

cos’ 6 + sin? 6 = 1. 


figure 1.2.5 The distance 
between the tips of a and b is 
| b-a |. 
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Unit Vectors 


Vectors with norm 1 are called unit vectors. For example, the vectors i, j, K are unit 
vectors. Observe that for any nonzero vector ; when we divide 
aby |jal], we say that we have normalized a 


(a) Normalize v= 2i+ 3j — 5k 
(b) Find unit vectors a b, and cin the plane such that b+ c= a 


(a) We have ||wi| = \/22 + 32 + (1/2)? = (1/2)/53, so the normalization of vis 


1 4, 6. 1 
u= = 
(IMI 


=m m 5B 


(b) Because all three vectors are to have length 1, a triangle with sides a b, and c 
must be equilateral, as in Figure 1.2.3. If we orient the triangle as in the figure and 
we take a = i, then necessarily 


balie j a alit 
=y. = GN ae 


Note that indeed |jal| = |b] = |c| = 1 and tht b+ c=a A 


Inthe plane, define the vector ig = (cos @)i+ (sin@)j, whichis the unit vector making 
an angle 6 with the x axis (see Figure 1.2.4). 


Distance 


If aand bare vectors, we have seen that the vector b— ais parallel to and has the same 
magnitude as the directed line segment from the endpoint of ato the endpoint of b 
It follows that the distance from the endpoint of ato the endpoint of bis ||b— al] (see 
Figure 1.2.5). 


Inner Product, Length, and Distance Letting a= aÏ + aj + a3K and 
b= bii + bj + b3k, their inner product is 


a- b= abı + arb + a3b3, 


while the length of ais 


laj = /a-a= ya? +a} + a}. 


To normalize a vector a form the vector 
a 
lial 


The distance between the endpoints of aand bis ||a—bj|, and the distance between 
P and Q is PĜ. 
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solution 


Find the distance from the endpoint of the vector i, that is, the point (1, 0, 0), to the 
endpoint of the vector j, that is, the point (0, 1, 0). 


Ij- il = “0-1? + (1-0/7 + (0-0 = V2. 4 


The Angle Between Two Vectors 


Let us now show that the inner product does indeed measure the angle between two 
vectors. 


Theorem | Let aand bbe two vectors in R? and let 6, where 0 < 0 < x, be 
the angle between them (Figure 1.2.6). Then 


a- b= |lal|||bl| cos 6. 


It follows from the equation a- b = ||aliiıbj; cos@ that if aand bare nonzero, we 
may express the angle between them as 


a-b 
0 = cos- ( } 
|| aj il bli 


proof If we apply the law of cosines from trigonometry to the triangle with one 
vertex at the origin and adjacent sides determined by the vectors a and b (as in the 
figure), it follows that 


Ib- all? = al? + ||? — 2al ibj] cos 8. 


Because ||b— all? = (b— a) -(b— a, jja? = a- a and ||b|? = b- b, we can rewrite 
the preceding equation as 


(b-a -(b— a = a-a+ b- b- 2jjajibj| cose. 


figure 1.2.6 The vectors a, b, and the angle 6 
between them; the geometry for Theorem 1 and its 
proof. 
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We can also expand (b— a) - (b— a) as follows: 

(b— a)-(b— a = b- (b-a — a-(b— a) 
=b-b—b-a—a-b+a-a 
=a-a+b-b-2a-b 

Thus, 
a-a+ b.b- 2a-b=a-a+b- b— 2|/al|||b|| cos. 
That is, 


a- b= |jali|bj cos6. m 


example 4 | Find the angle between the vectors i+ j + kand i+ j — k (see Figure 1.2.7). 


figure 1.2.7 Finding the angle between 
a=i+j+kond b=i+j-k. 


solution Using Theorem 1, we have 


(i+j+k)-(i+j—k =|i+j+kili+j— kj cose, 


and so 
1+1—1=(¥V3)(V3) cosé. 
Hence, 
coso = į. 
That is, 


@ = cos™}(4) ~ 1.23 radians (71°). A 


The Cauchy-Schwarz Inequality 


Theorem 1 showsthatthe inner productof two vectors isthe productof their lengths times 
the cosine of the angle between them. This relationship is often of value in problems of 
a geometric nature. An important consequence of Theorem 1 is: 
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example 6 


example 7 


Corollary Cauchy-Schwarz Inequality For any two vectors aand b, we 
have 


|a- bj < |\al|||b| 


with equality if and only if ais a scalar multiple of b, or one of them is O 


proof If aisnotascalar multiple of b, then 6, the angle between them, is not zero or 
gr, and so |cos @| < 1, and thus the inequality holds; in fact, if aand bare both nonzero, 
strict inequality holds in this case. When ais a scalar multiple of b, then @ = 0 or x 
and |cosé@| = 1, so equality holds in this case. E 


Verify the Cauchy- Schwarz inequality for a= —i + j + kand b= 3i + k. 


The dot productis a- b= -3+ 0+1 = —2,50 ja; b| = 2. Also, jaj = v1 +1 +1 = 
v3 and ||bl| = /9 +1 = V10, and it is true that 2 < /3- 4/10 because v3 - VTO > 
/3-/3=3>2. A 


If aand bare nonzero vectors in R? and 8 is the angle between them, we see that 
a: b= Oif and only if cos@ = 0. Thus, the inner product of two nonzero vectors is zero 
if and only if the vectors are perpendicular. Hence, the inner product provides us with 
a convenient method for determining whether two vectors are perpendicular. Often we 
say that perpendicular vectors are orthogonal. T he standard basis vectors i, j, and kare 
mutually orthogonal and of length 1; any such system is called orthonormal. We shall 
adopt the convention that the zero vector is orthogonal to all vectors. 


The vectors ij = (cos@)i+ (sin@)j and jg = —(sin@)i + (cos@)j are orthogonal, 
because 


ip -jọ = — cos 0 sin 0 + sin 8 cos = 0. 


Here, i, is the rotation of i, 6° counterclockwise. Also, jẹ is the rotation of j, 0° 
counterclockwise (see Figure 1.2.8). 


figure 1.2.8 The vectors ig and jg are orthogonal and of 
unit length, that is, they are orthonormal. 


Let aand bbe two nonzero orthogonal vectors. If cis a vector in the plane spanned by 
aand b, then there are scalars œ and £ such that c= wa+ £b. Use the inner product 
to determine a and £ (see Figure 1.2.9). 
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a 
_— 


figure 1.2.9 The geometry for finding a and £, where 
c=aa+ fb. 


x 


solution Taking the inner product of aand c, we have 
a-C=a-(aa+ Bb) = aa-a+ ga. b. 


Because aand bare orthogonal, a- b= 0, and so 


ac ac 
aa |a 
Similarly, 


Orthogonal Projection 


In the preceding example, the vector «ais called the projection of calong a and Bbis 
its projection along b. L et us formulate this idea more generally. If wis a vector, and / is 
the line through the origin in the direction of a vector a, then the orthogonal projection 
of von ais the vector pwhose tip is obtained by dropping a perpendicular line to / from 
the tip of v, asin Figure 1.2.10. 

Referring to the figure, we see that pis a multiple of aand that vis the sum of pand 
a vector q perpendicular to a Thus, 


v=ca+q 


where p= caand a- q = 0. Taking the dot product of awith both sides of v = ca+ q 
we find a- v= ca- a so c = (a- v) /(a- a), and hence 


figure 1.2.10 p is the orthogonal a-v 


projection of v on a. p= ial? 
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solution 


Orthogonal 
projection 
of vona 


figure 1.2.11 The 
orthogonal projection of v 
on a equals —ta. 


The length of pis 


|a: v| |a: v| 
= —— |a| = —— = |ivi| cos ð. 
II Pll jal? || al jal Iivi 


Orthogonal Projection The orthogonal projection of von ais the vector 
62i? 


Find the orthogonal projection of i+ j on i — 2j. 
With a= i — 2j and v= i + j, the orthogonal projection of von ais 


a- v _1=2 
aa 1+4 


ee | ne 
(i— 2j) = zli 2j) 


(see Figure 1.2.11). A 


The Triangle Inequality 


A useful consequence of the Cauchy-Schwarz inequality, which is called the triangle 
inequality, relates the lengths of vectors aand band of their sum a+ b Geometrically, 
the triangle inequality says that the length of any side of a triangle is no greater than the 
sum of the lengths of the other two sides (see Figure 1.2.12). 


Theorem 2 Triangle Inequality For vectors aand bin space, 


la+ bj < lial + Ibl). 


proof Whilethisresultmay beclear geometrically, itis useful to give a proof using the 
Cauchy- Schwarz inequality, as it will generalize to n-dimensional vectors. We consider 
the square of the left-hand side: 


ja+ bi? = (a+ b)- (a+ b) = |a? + 2a- b+ ||bi*. 


figure 1.2.12 This geometry shows that 
IOQII < IORI] + || RQ] or, in vector notation, 
that || a + b|| < || all + || bl], which is the 
triangle inequality. 
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solution 


u-W 
w 


figure 1.2.13 Illustrating 
the inequality || u — v || < 
lu-w]+iw-v Il. 


example 10 


solution 
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By the Cauchy- Schwarz inequality, we have 
lal? + 2a- b+ || bl? < ial? + 2/al||| bl + |b)? = (lall + || by)’. 
Thus, 
la+ bl? < (jall + |b)’; 


taking square roots proves the result. E 


~ 
w 
— 


Verify the triangle inequality fora=i+jandb=2i+j+k 


kem 


Prove that ||u— vj < |u— wi + ||w— vj for any vectors u, v, and w. Illustrate 
with a figure in which u, v, and whave the same base point. 


We have a+ b= 3i + 2j + k, so |ja+ b|| = V9 + 4+ I = v14. On the other 
hand, jall = v2 and ||bl| = 6, so the triangle inequality asserts that 

v14 < /2 + v6. The numbers bear this out: V14 ~ 3.74, while 

J2 + /6 ~ 1.41 + 2.45 = 3.86. 


We find that u— v= (u — w + (w-— v), so the result follows from the triangle 
inequality with areplaced by u— wand breplaced by w— v. Geometrically, we 
are considering the shaded triangle in Figure 1.2.13. A 


~ 
w 
— 


zZ 


Physical Applications of Vectors 


A simple example of a physical quantity represented by a vector is a displacement. 
Suppose that, on a part of the earth's surface small enough to be considered flat, we 
introduce coordinates so that the x axis points east, the y axis points north, and the 
unit of length is the kilometer. If we are at a point P and wish to get to a point Q, the 
displacement vector d = PQ joining P to Q tells us the direction and distance we have 
to travel. If x and y are the components of this vector, the displacement of P to Q is “x 
kilometers east, y kilometers north.” 


Suppose that two navigators who cannot see one another but can communicate by radio 
wish to determine the relative position of their ships. Explain how they can do this if 
they can each determine their displacement vector to the same lighthouse. 


Let Pı and P2 be the positions of the ships, and let Q be the position of the lighthouse. 
The displacement of the lighthouse from the ith ship is the vector q joining P; to Q. 
The displacement of the second ship from the first is the vector djoining Pı to P2. We 
have d+ œ = q (Figure 1.2.14), and so d= d — œ. That is, the displacement from 
one ship to the other is the difference between the displacements from the ships to the 
lighthouse. 


figure 1.2.14 Vector methods can be used 
to locate objects. A 
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Position after 1 h 


Initial 10 
position 2 

figure 1.2.15 If an 
object moves northeast 


at 10 km/h, its velocity 
vector has components 


(10/4/2, 10/4/2) = 
1001/4/2, 174/2. where 
(/y/2, 1/42) is the unit 


vector of the northeast 
direction. 


example 11] 


solution 


We can also represent the velocity of a moving object as a vector. For the moment, 
we will consider only objects moving at uniform speed along straight lines. Suppose, 
for example, that a boat is steaming across a lake at 10 kilometers per hour (km/h) in 
the northeast direction. After 1 hour of travel, the displacement is (10/./2, 10/./2) ~ 
(7.07, 7.07); see Figure 1.2.15. 

The vector whose components are (10/./2, 10/./2) is called the velocity vector of 
the boat. In general, if an object is moving uniformly along a straight line, its velocity 
vector is the displacement vector from the position at any moment to the position 1 unit 
of time later. |f a current appears on the lake, moving due eastward at 2 km/h, and the 
boat continues to point in the same direction with its engine running at the same rate, 
its displacement after 1 hour will have components given by (10/./2 + 2, 10/./2); see 
Figure 1.2.16. The new velocity vector, therefore, has components (10/./2+2, 10/./2). 
We note that this is the sum of the original velocity vector (10/./2, 10/./2) of the boat 
and the velocity vector (2, 0) of the current. 


Displacement and Velocity |f an object has a (constant) velocity vector 
v, then in z units of time the resulting displacement vector of the object is d = rv, 
thus, after time £ = 1, the displacement vector equals the velocity vector. See 
Figure 1.2.17. 


Displacement due 
to current 
Displacement .————>* 
due to 
engine 


Total displacement Displacement in time t 


figure 1.2.16 The total displacement figure 1.2.17 Displacement = 
isthe sum of the displacements due time x velocity. 
to the engine and the current. 


A bird is flying in a straight line with velocity vector 10i + 6j + k (in kilometers per 
hour). Suppose that (x, y) are its coordinates on the ground and z is its height above the 
ground. 


(a) If the bird is at position (1, 2, 3) at a certain moment, what is its location 1 hour 
later? 1 minute later? 


(b) How many seconds does it take the bird to climb 10 meters? 


(a) The displacement vector from (1, 2, 3) after 1 hour is given by 10i + 6j + k, so 
the new position is (1, 2, 3) + (10, 6, 1) = (11, 8, 4). After 1 minute, the displacement 
vector from (1, 2, 3) is (10i + 6j + k) = zi+ $j + Kk, and so the new position is 
(1, 2,3) + (2, a ay) = (i HS). 

(b) After ż seconds (= r/3600 hours), the displacement vector from (1, 2, 3) is (t/3600) 
(10i + 6j + k) = (r/360)i + (t/600)j + (t/3600)k The increase in altitude is the z 
component, namely, r/3600. This will equal 10 m (= 15 km) when ¢/3600 = T that 
is, whent=365. A 
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example 12 | Physical forces have magnitude and direction and may thus be represented by vectors. 
If several forces act at once on an object, the resultant force is represented by the sum 
of the individual force vectors. Suppose that forces i+ Kand j + Kare acting on a body. 
W hat third force F must we impose to counteract the two— that is, to make the total 
force equal to zero? 


solution | Theforce F should be chosen so that (i+ k) + (j+k) +F = Q thatis, F = —(i+ k) — 
(j+ k) = —i — j — 2k. (Recall that Ois the zero vector, the vector whose components 
are all zero.) A 


exercises 


L Calculate (3i+ 2j +k) - (i+ 2j—k). 4. Compute u- v, where u = /3i — 315j + 22kand 
v=u/|Uul. 
2. Calculate a- b, where a = 2i + 10j — 12kand 
b= —3i + 4k 5. Is ||8i— 12kl|| - 6j + ki] — |(8i — 12k) - (6j + k)| equal 
to zero? Explain. 
3. Find the angle between 7j + 19kand —2i — j (to the 
nearest degree). 


In Exercises 6 to 11, compute ||U||, ||V\|, and U- V for the given vectors in R3. 


G u= 15i- 2j+ 4k, v= ri + 3j- k 16. Normalize the vectors in Exercises 6 to 8. (Only the 
solution corresponding to Exercise 7 is in the Student 
7. u=2j-i,v=-j+i Guide.) 


u=5i-j+2kv=i+j—k 17. Find the angle between the vectors in Exercises a 11. 
If necessary, express your answer in terms of cos~*. 

u= -i+ 3j+k v= -2i - 3j - 7k 18. Find all values of x such that (x, 1, x) and (x, —6, 1) are 

orthogonal. 


19. Find all values of x such that (7, x, —10) and (3, x, x) 


8. 

9. 

10. u= —i + 3k, v= 4j 
IL u= -i+ 2j- 3k v= —i — 3j + 4k are orthogonal. 

12. 20. Find the projection of u= —i + j + konto 


= 2 j i 
Let v= (2, 3). Suppose we R4 is perpendicular to v, v= 2i+j —3k 


and that || wi| = 5. This determines wup to sign. Find 


one such w 2L Find the projection of v= 2i + j — 3konto 
u=-i+j+k 
13. Find b and c so that (5, b, c) is orthogonal to both (1, 2, 
3) and (1, —2, 1). 22. What restrictions must be made on the scalar b so that 
the vector 21+ bj is orthogonal to (a) —3i+ 2j + kand 
14 Letv = (0, 3, 0), w = (2, 2, 0), v3 = (1, 1, 3). These (b) K 
three vectors with their tails at the origin determine a 
parallelepiped P. 23. Vectors vand ware sides of an equilateral triangle 
whose sides have length 1. Compute v- w 
(a) Draw P. 


, o 24. Let b= (3, 1, 1) and P be the plane through the origin 
(b) Determine the length of the main diagonal (from the given by x + y + 2z = 0. 


origin to its opposite vertex). 
; ; : (a) Find an orthogonal basis for P. That is, find two 
15. w hat is the geometric relation between the vectors wand nonzero orthogonal vectors vı, w € P. 
wif v- w= —||vi| |w? 
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(b) Find the orthogonal projection of bonto P. That is, 
find Projy, b+ Projw, b. 


Find two nonparallel vectors both orthogonal to (1, 1, 1). 


Find the line through (3, 1, —2) that intersects and is 
perpendicular to the line 
x=-l+t,y=—2+1,z2=—1+1. [HINT: If 
(xo, yo, zo) is the point of intersection, find its 
coordinates. ] 


«a Using the dot product, prove the converse of the 


Pythagorean theorem. That is, show that if the lengths of 
the sides of a triangle satisfy a? + b? = c2, then the 
triangle is a right triangle. 


For v= (v1, v2, v3) leta, B, y denote the angles 
between wand the x, y, and z axes, respectively. Show 
that cos? æ + cos? 8 + cos? y = 1. 


A ship at position (1, 0) on a nautical chart (with north 
in the positive y direction) sights a rock at position (2, 
4). What is the vector joining the ship to the rock? W hat 
angle 6 does this vector make with due north? (This is 
called the bearing of the rock from the ship.) 


Suppose that the ship in Exercise 29 is pointing due 
north and traveling at a speed of 4 knots relative to the 
water. There is a current flowing due east at 1 knot. The 
units on the chart are nautical miles; 1 knot = 1 nautical 
mile per hour. 


(a) If there were no current, what vector uwould 
represent the velocity of the ship relative to the sea 
bottom? 


(b) If the ship were just drifting with the current, what 
vector wwould represent its velocity relative to the sea 
bottom? 


(c) What vector wrepresents the total velocity of the 
ship? 


(d) Where would the ship be after 1 hour? 
(e) Should the captain change course? 
(f) What if the rock were an iceberg? 


An airplane is located at position (3, 4, 5) at noon and 
traveling with velocity 400i + 500j — k kilometers per 
hour. The pilot spots an airport at position (23, 29, 0). 


(a) At what time will the plane pass directly over the 
airport? (Assume that the plane is flying over flat ground 
and that the vector k points straight up.) 


(b) How high above the airport will the plane be when it 
passes? 


32. The wind velocity wy is 40 miles per hour (mi/h) from 


east to west while an airplane travels with air speed w of 
100 mi/h due north. The speed of the airplane relative to 
the ground is the vector sum vı + Vp. 


(a) Find vı + w2. 


(b) Draw a figure to scale. 


33. A force of 50 Ib is directed 50° above horizontal, 


pointing to the right. Determine its horizontal and 
vertical components. Display all results in a figure. 


34. Two persons pull horizontally on ropes attached to a 


post, the angle between the ropes being 60°. Person A 
pulls with a force of 150 Ib, while person B pulls with a 
force of 110 Ib. 


(a) The resultant force is the vector sum of the two 
forces. Draw a figure to scale that graphically represents 
the three forces. 


(b) Using trigonometry, determine formulas for the 
vector components of the two forces in a conveniently 
chosen coordinate system. Perform the algebraic 
addition, and find the angle the resultant force makes 
withA. 


35. A 1-kilogram (1-kg) mass located at the origin is 


suspended by ropes attached to the two points (1, 1, 1) 
and (—1, —1, 1). If the force of gravity is pointing in the 
direction of the vector —k, what is the vector describing 
the force along each rope? [HINT: Use the symmetry of 
the problem. A 1-kg mass weighs 9.8 newtons (N).] 


Suppose that an object moving in direction i + j is acted 
on by a force given by the vector 2i + j. Express this 
force as the sum of a force in the direction of motion and 
a force perpendicular to the direction of motion. 


37. A force of 6 N makes an angle of x /4 radian with the y 


axis, pointing to the right. The force acts against the 
movement of an object along the straight line connecting 
(1, 2) to (5, 4). 


(a) Find a formula for the force vector F. 


(b) Find the angle 6 between the displacement direction 
D = (5 — 1)i + (4 — 2)j and the force direction F. 


(c) The work done is F - D, or, equivalently, 
|| F|| || D|] cos @. Compute the work from both formulas 
and compare. 
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38. Show that in any parallelogram the sum of the squares of 39. Using vectors, show that the diagonals of a rectangle are 
the lengths of the four sides equals the sum of the perpendicular if and only if the rectangle is a square. 
squares of the lengths of the two diagonals. 


1.3 Matrices, Determinants, and the Cross Product 


example |] | 


In Section 1.2 we defined a product of vectors that was a scalar. In this section we shall 
define a product of vectors that is a vector; that is, we shall show how, given two vectors 
aand b, we can produce a third vector a x b, called the cross product of aand b. This 
new vector will have the pleasing geometric property that it is perpendicular to the plane 
spanned (determined) by aand b. The definition of the cross product is based on the 
notions of the matrix and the determinant, and so these are developed first. Once this 
has been accomplished, we can study the geometric implications of the mathematical 
structure we have built. 


2 x 2 Matrices 
We define a2 x 2 matrix to be an array 


a11 412 
an az |" 
where a11, a12, a21, aNd az are four scalars. For example, 
2 1 -1 0 ana 13 (7 
0 4)’ T. dy’ 6 11 
are 2 x 2 matrices. The determinant 


411 412 
421 422 


of such a matrix is the real number defined by the equation 


aun 42) = ayan — anan. (1) 
ay an 

Lä Wia 5 6l 7 

F j|=1-1=0 ; | =4-6= 2: E e| = 40-42 = -2 


3 x 3 Matrices 
A 3 x 3 matrix is an array 
41 412 413 


a21 422) 423), 
431 432 433 
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where, again, each a;; is a scalar; a;; denotes the entry in the array that is in the ¿th row 
and the jth column. We define the determinant of a 3 x 3 matrix by the rule 


411 412 413 
a21 42 423| = 411 
431 432 433 


a21 422 
431 432 


a22 423 
432 433 


; (2) 


Without some mnemonic device, formula (2) would be difficult to memorize. The rule 
to learn is that you move along the first row, multiplying a1; by the determinant of the 
2 x 2 matrix obtained by canceling out the first row and the jth column, and then you 
add these up, remembering to put a minus in front of the a2 term. For example, the 
determinant multiplied by the middle term of formula (2), namely, 


421 423 
431 433 


is obtained by crossing out the first row and the second column of the given 3 x 3 matrix: 


100 
1 ol lo o| lo 1 
0 1 o=1| |-o] [+0] =i 
a a mt a a 
L25 
as eik aor S458 S-ar pian 
8 97217 9 +317 8 
789 m 


Properties of Determinants 


An important property of determinants is that interchanging two rows or two columns 
results in a change of sign. For 2 x 2 determinants, this is a consequence of the definition 
as follows: For rows, we have 


Ge E ananz — anay = —(anan — anan) = — a 
a21) a22 ail an 
and for columns, 
ail anj _ |an an 
= —(a12a71 — anan) = — ' 
a21 422 a22 421 


We leave it to you to verify this property for the 3 x 3 case. 

A second fundamental property of determinants is that we can factor scalars out of 
any row or column. For 2 x 2 determinants, this means 
aay an 


aj} Adj? 411 412 


421 422 


dai, ddaj2 
a21 a22 


a1 a2 
Qa21 Qa 


Q@a21 422 421 a2 
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Similarly, for 3 x 3 determinants we have 


aj, Qan Q43 411 412 413 a1 œa 443 
a21 a22 423) = Q |421] 4222 4233| = |21 8a22 473), 
431 432 433 431 432 433 431 &4a32 433 


and so on. These results follow from the definitions. In particular, if any row or column 
consists of zeros, then the value of the determinant is zero. 

A third fundamental fact about determinants is the following: Zf we change a row 
(or column) by adding another row (or, respectively, column) to it, the value of the 
determinant remains the same. For the 2 x 2 case, this means that 


a a|_|a+bi a+b _ | a az 
by b| bı b2 ~ |by) +a, b +a 
_ a+a2 a la a, + a2 
bi +b2 b bı by +b) 


For the 3 x 3 case, this means 
a a2 G3 a+b, a+b, a3+b3 a+42 a2 a3 


by bp b3h = bı bz b3 = |b +b: bz b3, 
C1 C2. C3 Ci C2 C3 CLTC C2 63 


and so on. Again, this property can be proved using the definition of the determinant. 


Suppose 
a=ab+ fc that is, a= (a1, az, a3) = a(bı, b2, b3) + Blcr, c2, c3). 


Show that 


a, a a3 
bi b b; =0. 
C1 C2 C3 


We shall prove the case œ + 0, 8 # 0. The case œ = 0 = £ is trivial, and the case 
where exactly one of a, 6 is zero is a simple modification of the case we prove. Using 
the fundamental properties of determinants, the determinant in question is 


ab,+ cı æbı+ c2 ab3+ Bo3 
by b2 b3 


Cl C2 C3 


1 |e + Ber aby + Bez ob3 + Bes 
= —-—| —gabı —ab? —ab3 
a Cl C? C3 


(factoring —1/a out of the second row) 


ab; —ab? —ab3 
—Bey —Be2 —Pc3 
(factoring —1/£ out of the third row) 


1) ( 3 ab, + Bc, aby+ Bc, ab3 + Bc3 
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1 | Bar po Be 
= ab; ab ab3 (adding the second row to the first row) 
ap —Bc, =e; —Bc3 
1 0 0 0 
= abı ab ab3 (adding the third row to the first row) 
ae —Be, —Be2 —Be3 
=0 A 


Closely related to these properties is the fact that we can expand a 3 x 3 determinant 
along any row or column using the signs in the following checkerboard pattern: 


For instance, you can check that we can expand “by minors” along the middle row: 


a1 an ag an a3 a ag a ay 
a2 422. az| = —a21 + an — az3 
431 432 433 a32 433 a31 433 a31 432 


Let us redo the second determinant in Example 2 using this formula: 


2 3 2 3 1. 3 1 2 
5 6)=—-4 +5 —6 = (—4)(—6) + (5)(12) 
8 9 8 9 7 9 7 8 


+ (—6)(6) = 0. 


~~ Dr 


Determinants appear to have been invented and first used by Leibniz in 1693, in 
connection with solutions of linear equations. Maclaurin and Cramer developed 
their properties between 1729 and 1750; in particular, they showed that the 
solution of the system of equations 


1X1 + Q12X2 + Q13% = by 
Cg) X1 + Q22X2 + Q23% = b2 
C31 X1 + Q32X2 + Q33% = b3 
is 
1 bi a2 a3 ;;an bi a3 1 [a Ae bı 
am b az 3], a9 Qna b gz}, ba Qan Qz by}, 
b3 Q32 @33 Q3 b3 33 Q3) Q32 b3 
where 


Qni Q2 Ag 
A=]|@n A2 3], 
Q31 O32 Q33 


example 4 


solution 
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a fact now known as Cramers rule. While this method is rather inefficient from a 
numerical point of view, it is of theoretical importance in matrix theory. Later, 
Vandermonde (1772) and Cauchy (1812), treating determinants as a separate 
topic worthy of special attention, developed the field more systematically, with 
contributions by Laplace, Jacobi, and others. Formulas for volumes of 
parallelepipeds in terms of determinants are due to Lagrange (1775). We shall 
study these later in this section. Although during the nineteenth century 
mathematicians studied matrices and determinants, the subjects were 
considered to be distinct disciplines. For the full history up to 1900, see T. Muir, The 
Theory of Determinants in the Historical Order of Development (reprinted by 
Dover, New York, 1960). 


Cross Products 


Now that we have established the necessary properties of determinants and discussed 
their history, we are ready to proceed with the cross product of vectors. 


Definition The Cross Product Suppose that a= aii + a2j + a3k and b = 
bii + bj + b3kare vectors in R?. The cross product or vector product of a and 
b, denoted a x b, is defined to be the vector 


_ {42 43)~ |a 3] ay a2 
PO eel ee i al 
or, symbolically, 
i j k 
ax b= Gh (by Calo 
bı bı b3 


Even though we only defined determinants for arrays of real numbers, this formal 
expression involving vectors is a useful memory aid for the cross product. 


Find (3i — j +k) x (i+ 2j — k). 


i j k 
(3i-j+k x(i+2j-k =|3 -1 1)=~-i+4j+7k 
1 2 -l A 


Certain algebraic properties of the cross product follow from the definition. If a b, 
and care vectors and «œ, £, and y are scalars, then 


(i) ax b= -(bx a) 
(ii) ax (Bb+yo0 = (ax b) + y(ax © and (wat Bb) x c= alax d +8(bx o. 


Note that ax a= —(ax aj, by property (i). Thus, ax a= Q In particular, 


ixi=0 jxj=0 kxk=0. 
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Also, 
ixj=k jx k=i1, kxi=j, 


which can be remembered by cyclicly permuting i, j, Klike this: 
CO 
k j 
MA 


To give a geometric interpretation of the cross product, we first introduce the triple 
product. Given three vectors a, b, and c the real number 


(ax b)-c 


is called the triple product of a b, and c(in that order). To obtain a formula for it, let 
a= qi+ aj + ak, b= bii + bj + 53k, and c = cii + c2j + c3K. Then 


_ (|a2 43). ja 43); a, a2 . a r 
(ax b)-c= (; bs i bs j+ hi d k) (cai + cj + c3k) 
_ |@ a3) ., — |4 a3| o p ee a| a 
bo b| lbi bl T lbi b> 


This is the expansion by minors of the third row of the determinant, so 


al a2 @3 
(axb -c= bı bı b3}. 
Či C2 G3 


If cis a vector in the plane spanned by the vectors aand b, then the third row in the 
determinant expression for (a x b) -cis a linear combination of the first and second 
rows, and therefore (a x b) -c= 0. In other words, the vector a x bis orthogonal to 
any vector in the plane spanned by a and b, in particular to both aand b. 

Next, we calculate the length of a x b. Note that 


2 2 
ay, a3 a, a2 


bı b3 bı b 
= (ab3 — a3b2)? + (aib3 — b1a3)}? + (aib2 — byaz)?. 


2 
a a3 


2L 
lax bl = bz b3 


If we expand the terms in the last expression, we can recollect them to give 
(aj +a? +.a3)(bj +b} + b3) — (abi + azb2 + a3b3)’, 
which equals 
lal? llb? — (a- b)? = lal? || b|? — Ilall? Ibl? cos? @ = |Jall?|| |)? sin’ 9, 


where @ is the angle between a and b,0 < @ < x. Taking square roots and using 
Vk? = |k|, we find that |a x bl] = |/al|||bi||sin 6|. 

Combining our results, we conclude that ax bis a vector perpendicular to the plane 
P spanned by aand b with length |\al|||[b\||sin@|. We see from Figure 1.3.1 that this 
length is also the area of the parallelogram (with base ||al| and height ||bsin @||) spanned 
by aand b There are still two possible vectors that satisfy these conditions because 
there are two choices of direction that are perpendicular (or normal) to P. This is clear 


-s 


figure 1.3.2 The 
right-hand rule for 
determining in which of 
the two possible 
directions a x b points. 
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AN 
i figure 1.3.1 nı and nz are the two possible vectors 
bl| sin 6 orthogonal to both a and b, and with norm 
P EP Ia || | b || [sind]. 


from Figure 1.3.1, which shows the two choices m and —m perpendicular to P, with 


[||| = -m| = lallblsin 6). 


Which vector represents a x b, n, or —m? The answer is m. Try a few cases such 
as k = i x j to verify this. The following “right-hand rule” determines the direction of 
ax bin general. Take your right hand and place it so your fingers curl from atoward b 
through the acute angle 6, as in Figure 1.3.2. Then your thumb points in the direction 


ofax b. 


The Cross Product Geometric definition: ax bis the vector such that: 


(1) ||ax bl = |/al|||bl| sin 6, the area of the parallelogram spanned by aand b 
(8 is the angle between aand b; 0 < 0 < x); see Figure 1.3.3. 


(2) ax bis perpendicular to aand b, and the triple (a b, ax b) obeys the 
right-hand rule. 


Component formula: 
i j k 
(aii + aj + a3k) x (biñ+ b2) + bsk) = ja] a a3 
bı b bs 


= (azb; — a3b2)i — (aıb3 — a3b1)j + (aıb2 — a2b1) K. 


Algebraic rules: 


1. ax b= Oif and only if aand bare parallel or aor bis zero. 


. axb=-bxa 


. (a+b xc=axc+bxc 


2 
3. ax(b+0o =axb+axc 
4 
5. (wa) x b= a(ax b). 


Multiplication table: 


Second factor 


x i j k 
i (0) k -j 
First j| -k (0) i 


factor k j —İ (0) 
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example 5 


solution 


example 6 


solution 


example 7 


solution 


figure 1.3.3 The length of a x bis the 
area of the parallelogram formed by 
aand b. 


Find the area of the parallelogram spanned by the two vectors a= i + 2j + 3kand 
b= -i-k 


We calculate the cross product of aand b by applying the component or determinant 
formula, with a; = 1, a2 = 2, a3 = 3, bı = —1, bp = 0, b; = —1: 


ax b= [(2)(—1) — (3)(0)]i+ [(3)(—1) — (1)(—1)]j + [(1)(0) — (2)(—1)]k 
= -2i — 2j + 2k. 


Thus, the area is 


lax bl = /(—2)2 + (-2)2 + (2)? = 2V3. 4 


Find a unit vector orthogonal to the vectors i+ j and j + k. 


A vector perpendicular to both i+ j and j + kis their cross product, namely, the vector 


ij k 
(i+j)x(j+k=]1 1 0}=i-j+k 
0 1 1 
Because ||i — j + k| = /3, the vector 
Losa 
—(i-j+k 


V3 


is a unit vector perpendiculartoi+jandj+k A 


Derive an identity relating the dot and cross products from the formulas 
lux vi = jjuliivising and u-v= |jul|||vi cose 
by eliminating 8. 


Seeing sin@ and cosé multiplied by the same expression suggests squaring the two 
formulas and adding the results. We get 


Ju x vij? + (u-v)? = juli? lvii? (sin? 6 + cos? 4) = Jul? vi, 
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So 
lu x vil? = luj? ivi? — (ue vy’. 


This identity is interesting because it establishes a link between the dot and cross 
products. A 


Geometry of Determinants 


Using the cross product, we may obtain a basic geometric interpretation of 2 x 2 and 
3 x 3 determinants. Let a= a1Ì + aj and b= bi + bj be two vectors in the plane. If 
6 is the angle between aand b, we have seen that |a x bi] = |/all||Ib|||sin @| is the area 
of the parallelogram with adjacent sides aand b. The cross product as a determinant is 


i j k a & 
ax b= /aq, a O;= k. 
bi b of ibi b 


Thus, the area |ja x bj| is the absolute value of the determinant 


a, a2 


= aib — abı. 
A b> 192 201 


Geometry of 2x2 Determinants The absolute value of the determinant 
ay a2 


bi z is the area of the parallelogram whose adjacent sides are the vectors a = 


ai + aj and b= bii + b2). The sign of the determinant is + when, rotating in 
the counterclockwise direction, the angle from ato bis less than x. 


example 8 | Find the area of the triangle with vertices at the points (1, 1), (0, 2), and (3, 2) (see 
Figure 1.3.4). 


figure 1.3.4 (a) Find the area Aof the shaded triangle by expressing the sides as 
vector differences (b) to get A= || (b — a) x (© — a) ||/2. 
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solution 


Letta=i+j,b= 2j, and c = 3i+ Qj. It is clear that the triangle whose vertices 
are the endpoints of the vectors a b, and c has the same area as the triangle with 
vertices at 0. b— a and c— a(Figure 1.3.4). Indeed, the latter is merely a translation 
of the former triangle. B ecause the area of this translated triangle is one-half the area 
of the parallelogram with adjacent sides b— a= —i + j, and c— a= 2i + j, we 
find that the area of the triangle with vertices (1, 1), (0, 2), and (3, 2) is the absolute 
value of 


thatis, 3/2. A 


There is an interpretation of determinants of 3 x 3 matrices as volumes that is 
analogous to the interpretation of determinants of 2 x 2 matrices as areas. 


Geometry of 3 x 3 Determinants The absolute value of the determinant 


a a a3 
D= ln O Ba 


is the volume of the parallelepiped whose adjacent sides are the vectors 


a= ail + aaj + a3k, b= bii + b2j + b3k, and c= ci+ cj + ck. 


To prove the statement in the preceding box, we refer to Figure 1.3.5 and note that 
the length of the cross product, namely, |a x bjj, is the area of the parallelogram with 
adjacent sides aand b. M oreover, (ax b) -c= ||ax blji|c|| cos y, where w is the angle 
that c makes with the normal to the plane spanned by aand b Because the volume of 
the parallelepiped with adjacent sides a b, and cis the product of the area of the base 
jax bjj and the altitude ||c||| cos y|, it follows that the volume is |(a x b) -c)|. We saw 
earlier that (a x b) -c= D, so the volume equals the absolute value of D. 


figure 1.3.5 The volume of the 
parallelepiped spanned by a, b, ¢ is 
the absolute value of the 
determinant of the 3 x 3 matrix 
having a, b, ¢ as its rows. 


> y 


example 9 


solution 
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Find the volume of the parallelepiped spanned by the three vectors i + 3k, 2i +j — 2k, 
and 5i + 4k. 


The volume is the absolute value of 


10 3 
2 1 -2|. 
5 0 4 


If we expand this determinant by minors by going down the second column, the only 
nonzero term is 


1 3 
5 4 


a sali 


so the volume equals 11. A 


Equations of Planes 


Let P be a plane in space, Po = (xo, yo, zo) a point on that plane, and suppose that 
n= Ai+ Bj + Ckis a vector normal to that plane (see Figure 1.3.6). Let P = (x, y, z) 
be a point in R?. Then P lies on the plane P if and only if the vector PoP = (x —xo)i+ 
(y — yo)j + (z — zo) Kis perpendicular to n, that is, PoP -n= 0, or, equivalently, 


(Ai+ Bj + Ck) - [(x — xo)i+ (y — yo)j + (z — zo) kK] = 0. 
Thus, 


A(x — xo) + Bly — yo) + C(z — zo) = 0. 


Equation of a Plane in Space The equation of the plane P through 
(xo, yo, Zo) that has anormal vector n= AÌ + Bj + Ckis 


that is, (x, y, z) € P if and only if 
Ax +By+Cz+D=0, 


where D = — Axo — Byo — Czo. 


figure 1.3.6 The points P of the plane through Pp and 
perpendicular to n satisfy the equation PoP- n= 0. 
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example 10 


solution 


example 11] 


solution 


The four numbers A, B, C, and D are not determined uniquely by the plane P. To see 
this, note that (x, y, z) satisfies the equation Ax + By + Cz + D = 0 if and only if it 
also satisfies the relation 


(AA)x + (AB)y + (AC)z + (AD) = 0 


for any constant à + 0. Furthermore, if A, B, C, D and A’, B’, C’, D’ determine the 
same plane P, then A = AA’, B = AB’,C = AC’, D = AD’ for a scalar A. Conse- 
quently, A, B, C, D are determined by P up to a scalar multiple. 


Determine an equation for the plane that is perpendicular to the vector i + j + kand 
contains the point (1, 0, 0). 


Using the general form A(x — xo) + Bly — yo) + C(z — zo) = 0, the plane is 
(ix —1) + 1(y —0) +1(z—0) =0;thatis x+ y+z=1. A4 


Find an equation for the plane containing the three points (1, 1, 1), (2, 0, 0), and 
(1, 1, 0). 


Method 1. This is a “brute force” method that you can use if you have forgotten the 
vector methods. The equation for any plane is of the form Ax + By +Cz+D=0. 
Because the points (1, 1, 1), (2, 0, 0), and (1, 1, 0) lie in the plane, we have 


A+B+C+D=0, 
2A +D=0, 
A+B +D=0. 


Proceeding by elimination, we reduce this system of equations to the form 


2A+D=0 (second equation) 
2B+D=0 (2 x third — second), 
C=0 (first — third). 


Because the numbers A, B, C, and D are determined only up to a scalar multiple, we 
can fix the value of one of them, say A = 1, and then the others will be determined 
uniquely. We get A = 1, D = —2, B = 1, C = 0. Thus, an equation of the plane that 
contains the given points is x + y — 2 = 0. 


Method 2. Let P = (1,1,1),Q = (2,0,0), R = (1, 1, 0). Any vector normal to the 
plane must be orthogonal to the vectors QP and RÊ, which are parallel to the plane, 
because their endpoints lie on the plane. Thus, n = QP x RP is normal to the plane. 
Computing the cross product, we have 


, 0) lies on the plane, we conclude that the equation is given by 
-(z—0) =O;thatissx+y—-2=0. A 


Because the point (2, 0 
(x —2)+(y—0)+0 

Two planes are called parallel when their normal vectors are parallel. Thus, the 
planes Aix + Bry + Cız + Dı = 0 and Ax + Boy + Coz + D2 = 0 are parallel 
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when m = A+ Bij + Cık and m = A2i + Boj + C2k are parallel; that is, 
n, = om for a constant o. For example, the planes 


x—2y+z=0 and —2x + 4y — 2z = 10 
are parallel, but the planes 
x—2y+z= and 2x—2y+z=10 


are not parallel. 


Distance: Point to Plane 


L et us now determine the distance from a point E = (x1, yı, z1) to the plane P described 
by the equation A(x — xo) + Bly — yo) + C(z — zo) = Ax + By + Cz + D = 0. To 
do so, consider the unit normal vector 
ma Ai+ Bj + Ck 
AZ + BE EC? 
which is a unit vector normal to the plane. Drop a perpendicular from E to the plane 


and construct the triangle REQ shown in Figure 1.3.7. The distance d = EQ] is the 
length of the projection of v= RE (the vector from R to E) onto n; thus, 


Distance = |v- n| = |[(x1 — xo)i+ ( yı — yo)j + (zı — zo) K] - nj 
_ |A(x1 — xo) + Blyi — yo) + C(z1 — zo) | 


VA? + B24 C2 
If the planeis given inthe form Ax + By + Cz + D = 0, then for any point (xo, yo, zo) 
on it, D = —( Axo + Byo + Czo). Substitution into the previous formula gives the 


following: 


Distance from a Point to a Plane The distance from (x1, yı, zı) to the 
plane Ax + By +Cz+D=0is 


|Ax1 + By: + Czi + D| 
A?+B24+C2 


Distance = 


< = (x1, Yi z1) 


figure 1.3.7 The geometry for determining the 
distance from the point E to plane P. 


R= (xo Yo zo) 


~, 
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example 12 | Find the distance from Q = (2,0, —1) to the plane 3x — 2y + 8g + 1 = 0. 


solution | We substitute into the formula in the preceding box the values xı =2y; = 0, zı = —1 
(the point) and A = 3, B = —2, C = 8, D = 1 (the plane) to give 


Distance = |3-2+(—2)-0+8(-1)+1/) |- _ 1 
7 V32 + (=2)? + ~ JTT i JTT i 


The Origins of the Vector, Scalar, Dot, and Cross Products 


QUADRATIC EQUATIONS, CUBIC EQUATIONS, AND IMAGINARY NUMBERS. We 
know from Babylonian clay tablets that this great civilization possessed the 
quadratic formula, enabling them (in verbal form) to solve quadratic equations. 
Because the concept of negative numbers had to wait until the sixteenth century 
to see the light of day, the Babylonians did not consider either negative 

(or imaginary) solutions. 

With the Renaissance and the rediscovery of ancient learning, Italian 
mathematicians began to wonder about the solutions of cubic equations, 
X + ax? + bx+c=0, where a, b, and care positive numbers. 

Around 1500, Scipione del Ferro, a professor in Bologna (the oldest European 
university), was able to solve cubics of the form xX + bx = c, but kept his discovery 
secret. Before his death, he passed his formula to his successor, Antonio Fior, who 
for a while also kept the formula to himself. It remained a secret until a brilliant, 
self-taught mathematician named Nicolo Fontana, also known as Tartaglia (the 
stammerer), appeared on the scene. Tartaglia claimed he could solve the cubic, 
and Fior felt he needed to protect the priority of del Ferro, and so in response 
challenged Tartaglia to a public competition. 

We are told that Tartaglia was able to solve all of the thirty cubic equations 
posed by Fior. Amazingly, some scholars believe that Tartaglia discovered the 
formula for solutions to x° + cx = d only days before the contest was to take 
place. 

The greatest mathematician of the sixteenth century, Gerolamo 
Cardano (1501-—1576)—a Renaissance scholar, mathematician, 
physician, and fortuneteller—gave the first published solution of the general 
cubic. Although born of modest means, he (like Tartaglia) rose, through 
effort and natural brilliance, to great fame. Cardano is the author of 
the first book on games of chance (marking the beginning of modern 
probability theory) and also of Ars Magna (the Great Art), which marks 
the beginning of modern algebra. It was in this book that Cardano 
published the solution to the general cubic xX + ax? + bx+c=0. 

How did he get it? 

While working on his algebra book, and aware that Tartaglia was able to 
solve forms of cubic, Cardano, in 1539, wrote to Tartaglia asking for a meeting. 
After some cajoling, Tartaglia agreed. It was at this meeting that, in exchange for 
a pledge of secrecy (and we know how these generally go), Tartaglia revealed 
his solution, from which Cardano was able to derive a solution to the general 
equation, which then appeared in Ars Magna. Feeling betrayed, Tartaglia led a 
scathing attack on Cardano, leading to a small soap opera. 

What is important for us, at the moment, is that as a consequence of the 
method of solution, something very strange occurred. Consider the cubic 
X — 15x = 4. Its only positive root is 4. However, the Tartaglia—Cardano solution 


Historical Note 
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formula yields 


x= 4/24/1214 ey @) 


as the positive root. Thus, this number must be equal to 4. Yet this must be 
nonsense, because inside the cube root we are taking the square root of a 
Over 100 years later, in 1702, when Leibniz, codiscoverer of calculus, showed the 
great Dutch scientist Christian Huygens the formula 


Vb=V14+V3+y1-V3 (4) 


Huygens was completely flabbergasted, and remarked that this equality “defies 
all human understanding.” (Try, informally, to verify both formulas (1) and (2) for 
yourself.) 

Whether nonsense or not, Tartaglia and Cardano's formula forced 
mathematicians to confront square roots of negative numbers (or imaginary 
numbers, as they are called today). 


THE MATURING OF COMPLEX NUMBERS. For well over two centuries, numbers 
like i = \/—1 were looked at with great suspicion. The square root of any negative 
number can be written in terms of i; for example, /—a = ,/(a)(— 1) = Ja/-1 : 
In the middle of the eighteenth century, the Swiss mathematician Leonhard Euler 
connected the universal cosmic numbers eand z with the imaginary number i. 
Whatever i was or meant, it necessarily follows that 


e =-1, 


that is, e“raised to the power zi equals —1.” Thus, these cosmic numbers, 
reflecting perhaps some deeper mystery, are in fact connected to each other by 
a very simple formula. 

At the beginning of the nineteenth century, the German mathematician Karl 
Friedrich Gauss was able to prove the fundamental theorem of algebra, which 
says that any nth-degree polynomial has nroots (some or all of which may be 
imaginary; that is, the roots have the form a + bi, where, as earlier, i = “f= and 
where a and bare real numbers). 

By the middle of the nineteenth century, the French mathematician 
Augustin-Louis Cauchy and the German mathematician Bernhard Riemann had 
developed the differential calculus for functions of one complex variable. An 
example of such a function is F(z) = z”, where z= a + bi. In this case, the usual 
formula for the derivative, F '(z)=nz™", still holds. However, by introducing 
imaginary numbers, Cauchy was able fo evaluate “real integrals” that 
heretofore could not be evaluated. For example, it is possible to show that 


oo ai 

| SINX 
| — dx= 
iy. A 


f log sin x dx = —z log2. 
0 


N/a 


and that 


These were stunning results. 
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figure 1.3.8 Sir William Rowan 
Hamilton (1805-1865). 


In summary, the solution of the cubic equation, the fundamental theorem of 
algebra, and the evaluation of real integrals proved how valuable it was to 
consider imaginary numbers a+ bi, even though they were not (at least not yet) 
on ferra firma. Did they really exist or were they simply phantoms of our 
imagination, and thus truly imaginary? 


HAMILTON'S DEFINITION OF COMPLEX NUMBERS. Many mathematicians after 
Cardano made important contributions to imaginary (or complex) numbers, 
including Argand, Wessel, and Gauss—all of whom represented them 
geometrically. However, the modern, intellectually rigorous definition of a 
complex number is due to the great Irish mathematician William Rowan Hamilton 
(see Figure 1.3.8). After Newton, who created the vector concept through his 
invention of the notion of force, Hamilton was, beyond any doubt, the most 
important and singular figure in the development of vector calculus. It was 
Hamilton who gave us the terms vector and scalar quantity. 

William Rowan Hamilton was born in Dublin, Ireland, at midnight on August 3, 
1805. In 1823, he entered Trinity College, Dublin. His university career, by any 
standard, was phenomenal. By his third year, Trinity offered him a professorship, 
the Andrew's Chair of Astronomy, and the State named him Royal Astronomer of 
Ireland. These honors were based on his theoretical prediction (in 1824) of two 
entirely new and unexpected optical phenomena, namely, internal and external 
conical refraction. 

By 1827 he had become interested in imaginary numbers. He wrote that “the 
symbol gi is absurd, and denotes an impossible extraction...” He set out to 
put the idea of a complex number on a firm logical foundation. His solution was 
to define a complex number a + bi as a point (a b) in the plane R?, much as we 
do today. Thus, the imaginary number bi for Hamilton was simply the point (0, B 
on the yaxis. The difference between complex numbers and the Cartesian plane 
was that Hamilton followed the proforma multiplication of complex numbers: 


(a+ bi)(c+ di) = (ac— bd) + (ad+ be)i, 
and defined a new multiplication on the complex plane: 


(a, b) - (c, d) = (ac— bd, ad+ be). 


Thus, i = ,/—1 just disappears into the point (0, 1), and the mystery and 
confusion over complex numbers disappears along with it. 


FROM COMPLEX NUMBERS TO QUATERNIONS. From Hamilton's interpretation, 
complex numbers are nothing more than the extension of real numbers into a 
new dimension, two dimensions. Hamilton, however, also did fundamental work 
in mechanics, and he knew well that two dimensions were too limiting for the 
space analysis necessary for understanding the physics of the three-dimensional 
world. Therefore, Hamilton set out to find a triplet system; that is, an acceptable! 
multiplication scheme on points (a, b, c) in R, or, as it were, on vectors 

ai+ bj + ck. 


'For him, “acceptable” meant that the associative law of multiplication would hold. 
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By 1843, Hamilton realized that his quest was hopeless. But then, on October 
16, 1843, Hamilton discovered that what he could not achieve for R? he could 
achieve for R*; he discovered quaternions, an entirely new number system. 

Hamilton? had realized that the multiplication he had been searching for 
could be introduced on 4-tuples (a, b, c, d), which he had denoted by 


a+ bi + cj + dk. 


The awas called the scalar part and bi + cj + dk was called the vector part, 
which in reality, as with complex numbers, meant the point (a, b, c, d) in R*. The 
multiplication table he introduced was 


ij =k=-ji 
ki = j = —ik 
jk =i = —kj 

?P =p =k =ijk=-1. 


Hamilton continued to passionately believe in his quaternions until the end of 
his life. Unfortunately, historical development went in another direction. 

The first step away from the quaternions was in fact taken by a 
firm believer in the importance of quaternions, namely, Peter Guthrie Tait, who 
was born in 1831 near Edinburgh, Scotland. In 1860, Tait was appointed to the 
Chair of Natural Philosophy at Edinburgh University, where he remained until his 
death in 1901. In 1867, he wrote his Elementary Treatises on Quaternions, a text 
stressing physical applications. His third chapter was most significant. It was here 
that Tait looked at the quaternionic product of two vectors: 


v=ai + bj+ ck and w= di+bj+ck. 


Then the product vw, as defined by Hamilton, yields: 


(ai + bj + ck)(a'i+ bj + c’k) 
= —(aa' + bb’ + cc’) + (be — cb’)i + (ac’ — ca’)j + (ab’ — ba’)k 


or, in modern form: 
vw = —(V-W)4+Vxw, 


where - is the modern dot or inner product of vectors and x is the cross product. 
Tait discovered the formulas 


V-w = |ivilw]|cos? and — |v x wij = |vilw]| sin 8, 
where @ is the angle formed by v and w. Moreover, he showed that v x w was 


orthogonal to v and w, therefore giving a geometric interpretation of the 
quaternionic product of two vectors. 


2North British Review, 14(1858), p. 57. 
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This began the move away from the study of quaternions and back to 
Newton's vectors, with the quaternionic product eventually being replaced by 
two separate products, the inner product and the cross product. 

By the way, you might wonder why Hamilton did not at first discover the cross 
product, since it is a product on R°. The reason is that it did not have a 
fundamental property that he required—namely, it was not associative: 


0 = (i x i) x k £i x (i x k) = —k. 


Remarkably, Euler discovered the cross product in component form in 1750, 
and three years before Hamilton, Olinde Rodrigues also discovered a form of 
quartenionic multiplication. 


THE MOVE AWAY FROM QUATERNIONS. The scientists ultimately 

responsible for the demise of quaternions were James Clerk Maxwell 

(see Figure 1.3.9), Oliver Heaviside, and Josiah Willard Gibbs, a founder of 
statistical mechanics. In the 1860s, Maxwell wrote down his monumental 
equations of electricity and magnetism. No vector notation was used (it did not 
exist). Instead, Maxwell wrote out his equations in what we would now call 
“component form.” Around 1870, Tait began to correspond with Maxwell, 
piquing his interest in quaternions. 

In 1873, Maxwell published his epic work, Treatise on Electricity and 
Magnetism. Here (as we shall do in Chapter 8), Maxwell wrote down the 
equations of the electromagnetic field using quaternions, thus motivating 
physicists and mathematicians alike to take a closer look at them. From this 
manuscript many have concluded that Maxwell was a supporter of the 
“quaternionic approach” to physics. The truth, however, is that Maxwell was 
reluctant to use quaternions. It was Maxwell, in fact, who began the process of 
separating the vector part of a product of two quaternions (the cross product) 
from its scalar part (the dot product). 

It is known that Maxwell was troubled by the fact that the scalar part of the 
“square” of a vector (vv) was always negative (—v - v), which in the case of a 
velocity vector could be interpreted as negative kinetic energy—an 
unacceptable idea! 

It was Heaviside and Gibbs who made the final push away from quaternions. 
Heaviside, an independent researcher interested in electricity and magnetism, 
and Gibbs, a professor of mathematical physics at Yale, almost simultaneously— 
and independently—created our modern system of vector analysis, which we 
have just started to study. 


3Interestingly, if one is willing to continue to live with nonassociativity, there is also a vector product 


with most of the properties of the cross product in R7; this involves yet another number system called the 
octonians, which exists in R8. The nonexistence of across product in other dimensions is a result that 
goes beyond the scope of this text. For further information, see the American Mathematical Monthly, 74 
(1967), pp. 188-194, and 90(1983), p. 697, as well as}. Baez, “TheOctonians,” Bulletin of the American 
Mathematical Society, 39 (2002), pp. 145-206. One can show that systems like the quaternions and 
octonians occur only in dimension 1 (the reals R), dimension 2 (the complex numbers), dimension 4 
(the quaternions), and dimension 8 (the octonians). On the other hand, the “right” way to extend the 
cross product is to introduce the notion of differential forms, which exists in any dimension. We discuss 


their construction in Section 8.5. 


1.3 Matrices, Determinants, and the Cross Product 49 


In 1879, Gibbs taught a course at Yale in vector analysis with applications to 
electricity and magnetism. This treatise was clearly motivated by the advent of 
Maxwell's equations, which we will be studying in Chapter 8. In 1884, he 
published his Elements of Vector Analysis, a book in which all the properties of 
the dot and cross products are fully developed. Knowing that much of what 
Gibbs wrote was in fact due to Tait, Gibbs's contemporaries did not view his book 
as highly original. However, it is one of the sources from which modern vector 
analysis has come into existence. 

Heaviside was also largely motivated by Maxwell's brilliant work. His great 
Electromagnetic Theory was published in three volumes. Volume | (1893) 
contained the first extensive treatment of modern vector analysis. 

We all owe a great debt to E. B. Wilson's 1901 book Vector Analysis: A Textbook 
for the Use of Students of Mathematics and Physics Founded upon the Lectures of 
figure 1.3.9 James Clerk J. Willard Gibbs. Wilson was reluctant to take Gibbs's course, because he had just 
Maxwell (1831-1879). completed a full-year course in quaternions at Harvard under J. M. Pierce, a 

champion of quaternionic methods; but he was forced by a dean to add the 
course to his program, and he did so in 1899. Wilson was later asked by the editor 
of the Yale Bicentennial Series to write a book based on Gibbs's lectures. For a 
picture of Gibbs and for additional historical comments on divergence and curl, 
see the Historical Note in Section 4.4. 


exercises 


L Verify that interchanging the first two rows of the 3 x 3 3. Compute ax b wherea= i- 2j + k, b= 2i+j+k 
determinant 


1 2 1 4. Compute a- (bx ©), where aand bare as in Exercise 3 
3 0 1 and c= 3i-j + 2k 
2 0 2 
; - 5. Find the area of the parallelogram with sides aand b 
changes the sign of the determinant. Jiven in Crane. 
= EA a 6. A triangle has vertices (0, 0, 0), (1, 1, 1), and (0, —2, 3). 
(a) |2 -1 0 (b) 136 18 17 Find its area. 
4 32 45 24 2 l o 
3 0 1 5 e 7. What is the volume of the parallelepiped with sides 
2i+j—k 5i- 3k andi-2j+k 
“ r a w : Be 8. What is the volume of the parallelepiped with sides 
9 16 25 17 19 23 i, 3j — k, and 4i + 2j — K? 


In Exercises 9 to 12, describe all unit vectors orthogonal to both of the given vectors. 


9. ij 13. Compute u+ v, u- v, |jull, Ivi], and u x v, where 
u=i- 2j+k v=2i-j+2k. 
10. —5i + 9j — 4k, 7i + 8j + 9k 
14. Repeat Exercise 13 for 
IL —5i+ 9j-— 4k, 7i + 8j + 9k u= 3i+j—k v= —6i— 2j — 2k 


12 2i- 4j + 3k, —4i+ 8j — 6k 


The Geometry of Euclidean Space 


15. Find an equation for the plane that 
(a) is perpendicular to v= (1, 1, 1) and passes through 
(1, 0, 0). 


(b) is perpendicular to v= (1, 2, 3) and passes through 
(1,1, 1). 


(c) is perpendicular to the line 
Kr) = (5, 0, 2) + (3, —1, 1) and passes through 
(5, —1, 0). 


(d) is perpendicular to the line 
Kr) = (—1, —2, 3)t + (0, 7, 1) and passes through 
(2,4, —1). 


16. Find an equation for the plane that passes through 
(a) (0, 0, 0), (2, 0, —1), and (0, 4, —3). 
(b) (1,2, 0), (0,1, —2), and (4, 0, 1). 
(c) (2, —1, 3), (0, 0, 5), and (5, 7, —1). 


17. Show that the points (0, —2, —1), (1, 4, 0), (2, 10, 1) do 
not determine a unique plane. 


18. Let P be the plane defined by the equation 
x + y +z = 1. Which of the following points are 
contained in P? 
(a) (0, 0, 0) 
(b) (1,1, —1) 
(c) (—3, 8, —4) 
(d) (1, 2, —3) 


19. (a) Show that two parallel planes are either identical or 
they never intersect. 


(b) How do two nonparallel planes intersect? 


20. Find the intersection of the planes x + 2y + z = 0 and 
x—3y-z=0. 


2L Find the intersection of the planes x +(y—1)+z=0 
and —x + (y+ 1)—-z=0. 


22. Find the intersection of the two planes with equations 
3(x — 1) + 2y + (z + 1) = 0 and 
(x —1)+4y —(z+1) =0. 
23. (a) Prove the two triple-vector-product identities 
(ax b) x c= (a- gb - (b. ġa 


and 


ax (bx œo = (a. gġb- (a. bc. 


(b) Prove (u x v) x w= u x (v x w) if and only if 
(ux W x v=0 

(c) Also prove that 
(ux Vv) x w+ (vxw x u+ (Wx u x v=0 
(called the J acobi identity). 


24. (a) Prove, without recourse to geometry, that 
u. (Vx W = v. (wx u) = w. (u x v) = —u. (Wx Vv) 
= —w:(vx u) = —v- (u x W. 
(b) Use part (a) and Exercise 23(a) to prove that 
(ux v- (u xv) = (u-u)(v- v) — (u-v)(u-v) 


=s u-v 
—ju-v wv] 


25. Verify Cramer's rule. 


26. What is the geometric relation between the vectors vand 
wif |v x wi] = 5 IM Iwi]? 


27. Letv= (1, 1, 0) and w= (0, 2, —1). U se the algebraic 
rules and multiplication table on page 37 to compute 
v x wwithout using determinants. 


28. Find an equation for the plane that passes through the 
point (2, —1, 3) and is perpendicular to the line 
v= (1, —2, 2) + t(3, —2, 4). 


29. Find an equation for the plane that passes through the 
point (1, 2, —3) and is perpendicular to the line 
v= (0, —2, 1) + ¢(1, —2, 3). 

30. Find the equation of the line that passes through the 
point (1, —2, —3) and is perpendicular to the plane 
3x —y—2z2+4=0. 


3L Find an equation for the plane containing the two 
(parallel) lines 


vı = (0, 1, —2) + 2(2, 3, -1) 
and 
v = (2, —1, 0) + 2(2, 3, —1). 


32. Find a parametrization for the line perpendicular to 
(2, —1, 1), parallel to the plane 2x + y — 4z = 1, and 
passing through the point (1, 0, —3) 


33. Find an equation for the plane containing the point 
(1, 0, 1) and the line Ir) = (1, 2, —1) + ¢(1, 0, 5). 


34. Find the distance from the point (2, 1, —1) to the plane 
x —2y+2z4+5=0. 


35. Find an equation for the plane that contains the line 
v= (—1, 1, 2) + t(3, 2, 4) and is perpendicular to the 
plane 2x + y — 3z +4 = 0. 


36. Find an equation for the plane that passes through 
(3, 2, —1) and (1, —1, 2) and that is parallel to the line 
v= (1, —1, 0) + z(3, 2, —2). 


37. Redo Exercises 25 and 26 of Section 1.1 using the dot 
product and what you know about normals to planes. 


38. Given vectors aand b, do the equations x x a= band 
X- a= |lal| determine a unique vector x? Argue both 
geometrically and analytically. 


39. Determine the distance from the plane 
12x + 13y + 5z + 2 = 0 to the point (1, 1, —5). 


40. Find the distance to the point (6, 1, 0) from the plane 
through the origin that is perpendicular to i— 2j + k. 


4L (a) In mechanics, the moment M of a force F about a 
point O is defined to be the magnitude of F times 
the perpendicular distance d from O to the line of 
action of F. The vector moment M is the vector of 


magnitude M whose direction is perpendicular to the 


plane of O and F, determined by the right-hand rule. 
Show that M = R x F, where R is any vector from 
O to the line of action of F. (See Figure 1.3.10.) 


N 


ine of action 


figure 1.3.10 Moment of a 
force. 


(b) Find the moment of the force vector 
F =i — j + 2knewtons about the origin if the line 
of actionisx =1 +t, y=1-t,z=2t. 


42. Show that the plane that passes through the three points 
A = (aj, m, a3), B = (b, b2, b3), and C = (c1, c2, c3) 
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consists of the points P = (x, y, z) given by 


) Zz 
biy—x bo-—y b3-—z|=0. 
Pa 


(HINT: Write the determinant as a triple product.) 


43. Two media with indices of refraction nı and nz are 
separated by a plane surface perpendicular to the unit 
vector N. Let aand bbe unit vectors along the incident 
and refracted rays, respectively, their directions being 
those of the light rays. Show that n1(N x a) = n2(N x b) 
by using Snell’s law, sin, / sin 62 = n2/n1, where 61 
and 62 are the angles of incidence and refraction, 
respectively. (See Figure 1.3.11.) 


N Light ray 


figure 1.3.11 Snell's law. 


44. Justify the steps in the following computation: 


12 3 1 2 3 1 2 3 

45 6=l0 -3 -6 =l -3 -6 

7810 7 8 1 lo -6 -1 
e | 33—36 = —3 
=| -117 = 


45. Show that adding a multiple of the first row of a matrix 
to the second row leaves the determinant unchanged; 
that is, 


a by Cl a b ci 
a? + ìa] bz +Ab, c +ìcı| =a? b2 cl. 
a3 b3 c3 a3 b3 c3 


[In fact, adding a multiple of any row (column) of a 
matrix to another row (column) leaves the determinant 
unchanged.] 


46. Suppose v, we R? are orthogonal unit vectors. Let 


u= Vx w Show that w= ux vand v= wx u 
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Historical Note 


A standard way to represent a point in the plane R? is by means of rectangular coordinates 
(x, y). However, as you have probably learned in elementary calculus, polar coordinates 
in the plane can be extremely useful. As portrayed in Figure 1.4.1, the coordinates (r, @) 
are related to (x, y) by the formulas 


x =rcosé and y=rsing, 


where we usually taker > 0 and0 < 0 < 2x. 


(x, y) 


\ figure 1.4.1 The polar coordinates of (x,y) are (r, 6). 
0 


If you are not familiar with ploar coordinates, we advise you to study the relevant 
section of their calculus texts. We now set forth two ways of representing points in 
space other than by using rectangular Cartesian coordinates (x, y, z). These alternative 
coordinate systems are particularly well suited for certain types of problems, such as 
the evaluation of integrals using a change of variables. 


In 1671, Isaac Newton wrote a manuscript entitled The Method of Fuxions and 
Infinite Series, which contains many uses of coordinate geometry to sketch the 
solutions of equations. In particular, he introduces the polar coordinate system, 
among various other coordinate systems. 

In 1691, Jacob Bernoulli published a paper also containing polar coordinates. 
Because Newton's manuscript was not published until after his death in 1727, 
credit for the discovery of polar coordinates is usually attributed to Bernoulli. 


Cylindrical Coordinates 


Definition The cylindrical coordinates (r, 6, z) of a point (x, y, z) are defined 
by (see Figure 1.4.2) 


x =rcosé, y=rsiné, B= %& (1) 


example | 


solution 
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>» N 


(x,y,z) 


figure 1.4.2 Representing a point (x, y, 2 in terms 
Z of its cylindrical coordinates r,@, and z. 


figure 1.4.3 The graph of the points whose cylindrical 
coordinates satisfy r = a is a cylinder. 


To express r, 6, and z in terms of x, y, and z, and to ensure that 6 lies between 0 and 
27, we can write 


tan-1( y/x) ifx >Oandy>0 


r= /x?+y?, 6=<¢n+4+tan“( y/x) ifx <0 Z=, 
2x + tan-*( y/x) ifx > Oand y <0, 


where tan-?( y/x) is taken to lie between —z/2 and 2/2. The requirement that 0 < 6 < 
2x uniquely determines 6 and r > 0 for a given x and y. If x = 0, then 8 = x/2 for 
y > Oand 37/2 for y < 0. If x = y = 0, 8 is undefined. 

In other words, for any point (x, y, z), we represent the first and second coordinates 
in terms of polar coordinates and leave the third coordinate unchanged. Formula (1) 
shows that, given (r, 6, z), the triple (x, y, z) is completely determined, and vice versa, 
if we restrict 6 to the interval [0, 27) (sometimes the range (—z, x ] is convenient) and 
require that > 0. 

To see why we use the term cylindrical coordinates, note that if the conditions 
0 < 0<2n,-co <z < œ hold and if r = a is some positive constant, then the 
locus of these points is a cylinder of radius a (see Figure 1.4.3). 


(a) Find and plot the cylindrical coordinates of (6, 6, 8). (b) If a point has cylindrical 
coordinates (8, 27/3, —3), what are its Cartesian coordinates? Plot. 


For part (a), we have r = /6? + 62 = 6,/2 and 6 = tan-+(6/6) = tan-(1) = 7/4. 
Thus, the cylindrical coordinates are (6,/2, 7/4, 8). This is point P in Figure 1.4.4. 
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O(-4, 4V3, -3) 


figure 1.4.4 Some examples of the conversion between 
Cartesian and cylindrical coordinates. 


For part (b), note that 27/3 = 2/2 + 2/6 and compute 


2 
x= rosé = 800s = = 4 
and 
y=rsing =8sin Z = 8 = ay3. 


Thus, the Cartesian coordinates are (—4, 4/3, —3). This is point Q in the figure. A 


Spherical Coordinates 


Cylindrical coordinates are not the only possible generalization of polar coordinates to 
three dimensions. Recall that in two dimensions the magnitude of the vector xi + yj 
(that is, \/x? + y?) is ther in the polar coordinate system. For cylindrical coordinates, 
the length of the vector xi+ yj + zk, namely, 


p=yx? +y +z, 


is not one of the coordinates of that system— instead, we used the magnitude r = 
\/x? + y?, the angle @, and the “height” z. 

We now modify this by introducing the spherical coordinate system, which does 
use p as a coordinate. Spherical coordinates are often useful for problems that possess 
spherical symmetry (symmetry about a point), whereas cylindrical coordinates can be 
applied when cylindrical symmetry (symmetry about a line) is involved. 
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(x,y,z) 


ow z figure 1.4.5 Spherical coordinates (p, 6, $); 
the graph of points satisfying p = a is a 
WA sphere. 


Given a point (x, y, z) € R?, let 
p=Vlertry +e 
and represent x and y by polar coordinates in the xy plane: 
x =rcosd, y=rsing, (2) 


where r = \/x*+ y? and @ is determined by formula (1) [see the expression for @ 
following formula (1)]. The coordinate z is given by 


z = pCos¢, 
where ¢ is the angle (chosen to lie between 0 and x, inclusive) that the radius vector 
v = xi+ yj + zk makes with the positive z axis, in the plane containing the vec- 


tor v and the z axis (see Figure 1.4.5). Using the dot product, we can express @ as 
follows: 


v-k v-k 
coso = —, that is, = cos} (=). 
°= Wi # vi 


We take as our coordinates the quantities p, 0, @. Because 
r=psing, 


we can use formula (2) to find x, y, z in terms of the spherical coordinates p, 0, œ. 


Definition The spherical coordinates of points (x, y, z) in space are the triples 
(o, 8, @), defined as follows: 

x = psingcoss, y= psingsindg, z= pCcos®, (3) 
where 


p=), O< @ =< De, O<O< we 
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In 1773, Joseph Louis Lagrange was working on Newton's gravitational theory as 
it applied to ellipsoids of revolution. In attempting to calculate the total 
i : gravitational attraction of such an ellipsoid, he encountered an integral that was 
H istorical Note difficult to evaluate. Motivated by this application, he introduced spherical 
coordinates, which allowed him to calculate the integral. We will be discussing 
the method of changing coordinates as it applies to multiple integrals in Section 
6.2, and applications to gravitation in Section 6.3, where we show how the inverse 
square law of gravity allowed Newton to consider spherical masses as point 
masses. 
Spherical coordinates are also closely connected to navigation by latitude 
and longitude. To see the connection, first note that the sphere of radius a 
centered at the origin is described by a very simple equation in spherical 
coordinates, namely, p = a. Fixing the radius a the spherical coordinates 6 and ¢ 
are similar to the geographic coordinates of longitude and latitude if we take the 
earth's axis to be the z axis. There are differences, though: The geographical 
longitude is |6| and is called east or west longitude, according to whether 8 is a 
positive or negative measure from the Greenwich meridian; the geographical 
latitude is | /2 — | and is called north or south latitude, according to whether 
1/2—¢ is positive or negative. 


example 2 (a) Find the spherical coordinates of the Cartesian point (1, —1, 1) and plot. 


(b) Find the Cartesian coordinates of the spherical coordinate point (3, 7/6, 7/4) and 
plot. 


(c) Leta point have Cartesian coordinates (2, —3, 6). Find its spherical coordinates 
and plot. 


(d) Leta point have spherical coordinates (1, —x /2, 1/4). Find its Cartesian 
coordinates and plot. 


solution (a) p= +y +z =V +O +P = V3, 


—1 7 
0 = 2x +tan (2) =2r +e ( Z) = 2r — F = z 
x 


Zz 1 
=cos!(=)= cos( ~ 0.955 ~ 54.74°. 
(5) V3 


See Figure 1.4.6(a) and the formula for 6 following formula (1). 


(b) x = psingcosé = 3sin( 7) cos( =) =3/ 1 o 3/3 


4 6 I T 
y= psingsing =3sin(7) sin(=) =3(=) (5) = y 
T 3v2 


3 
z= pcosġ =3cos(7) = a 
See Figure 1.4.6(b). 
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(a) z 


(,-1,1) 2= 43 
= 55° 


figure 1.4.6 Finding (a) the spherical coordinates of the point (1,-1, 1), 
and (b) the Cartesian coordinates of (3, 2/6, 7/4). 


(c) p= yx? +y? +z’ = J? +(-3)7 +0 = V49 =7, 


b= te. tan-1( =) =2n + tan- (=) ~ 5.3004 radians ~ 303.69°, 


$ = cos! (<) = cos} (5) ~ 0.541 ~ 31.0°. 
p 7 
See Figure 1.4.7(a). 


(d) x = psing cos6 = 1sin( =) cos( z) = (F) o=o 


2 
y = psingsing =1sin(=) sin( =) = (+) 1) oe 


T 


z = pcosġ = 1cos( =) = 


See Figure 1.4.7(b). 


figure 1.4.7 Finding (a) the spherical coordinates of the 
point (2, —3, 6), and (b) the Cartesian coordinates of 
(,-2/2,7/4). A 
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figure 1.4.8 (a) Orthonormal 
vectors e,, €, and e; associated 
with cylindrical coordinates. The 
vector e, is parallel to the line 
labeled r. (b) Orthonormal 
vectors €,,, @,, and e, associated 
with spherical coordinates. 


example 3 


solution 


(a) 


x 


Express (a) the surface xz = 1 and (b) the surface x? + y? — z? =1 in spherical 
coordinates. 


From formula (3), x = psingcosé, and z = pcos@, and so the surface xz = 1 in 
(a) consists of all (o, 8, @) such that 


p’sindcosécos@=1,  thatis,  p*sin2¢cos@ = 2. 
For part (b), we can write 
xy dz S= Hy Haz nz = p* — 29? cos’, 
so that the surface is oọ?°(1 — 2 cos? @) = 1; that is, —p? cos (2¢) = 1. A 


Associated with cylindrical and spherical coordinates are unit vectors that are the 
counterparts of i, j, and kfor rectangular coordinates. They are shown in Figure 1.4.8. 
For example, e. is the unit vector parallel to the xy plane and in the radial direction, so 
that e = (cos@)i + (sin@)j. Similarly, in spherical coordinates, e, is the unit vector 
tangent to the curve parametrized by the variable @ with the variables o and @ held fixed. 
We shall use these unit vectors later when we use cylindrical and spherical coordinates 
in vector calculations. 


exercises 


L Find the spherical coordinates of the Cartesian point (—2,/3, —2, 3). (Only the first point is solved in the 


(v2, —¥6, -2,/2). 


2. Find the spherical coordinates of the Cartesian point 


Study Guide.) 


4. Describe the geometric meaning of the following 
mappings in cylindrical coordinates: 


(v6, —/2, -2,/2). 
(a) (7,6, z) => (r, 0, =z) 
3. (a) The following points are given in cylindrical (b) (7, 6, z) = (7,6 +x, =z) 
coordinates; express each in rectangular coordinates and (c) (r, 0, z) > (=r, 0 — 2/4, z) 
spherical coordinates: (1, 45°, 1), (2, 2/2, —4), 
(0, 45°, 10), (3, x/6, 4), (1, 2/6, 0), and (2, 32/4, —2). 5. Describe the geometric meaning of the following 
(Only the first point is solved in the Study Guide.) mappings in spherical coordinates: 
(b) Change each of the following points from (a) (p, 0, ¢) = (0,6 +7, 6) 
rectangular coordinates to spherical coordinates and to (b) (p,0,¢) => (p, 6,2 —¢) 


cylindrical coordinates: (2, 1, —2), (0, 3, 4), (V2, 1, 1), (c) (p,0, 6) + (20, 0 + 2/2, ¢) 


@ Sketch the following solids: 
(a) r € [0,1], 6 € [0, x], z € [-1, 1] 
(b) r €e [0,2], 6 €[0, x/2], z €e [0,4] 
(c) p €e [0,1], 6 e [0,27],  € [0, 7/4] 
(d) o €e [1,2], 6 e [0,27], e [0, x/2] 


7. Sketch the following surfaces: 


(a) z=r? 

(b) o = 4csc o seco 
(c) r=4sing 

(d) osing =2 


8. (a) Describe the surfaces r = constant, 6 = constant, 
and z = constant in the cylindrical coordinate system. 


(b) Describe the surfaces » = constant, 6 = constant, 
and @ = constant in the spherical coordinate system. 


9. Show that to represent each point in IR? by spherical 
coordinates, it is necessary to take only values of @ 
between 0 and 2x, values of @ between 0 and x, and 
values of p > 0. Are coordinates unique if we allow 
p <0? 


10. Describe the following solids using inequalities. State 
the coordinate system used. 


(a) A cylindrical shell 8 units long, with inside 
diameter 2 units and outside diameter 3 units 


(b) A spherical shell with inside radius 4 units and 
outside radius 6 units 

(c) A hemisphere of diameter 5 units 

(d) A cube of side length 2 


IL Let S be the sphere of radius R centered at the origin. 
Find the equation for S in cylindrical coordinates. 


I2. Using cylindrical coordinates and the orthonormal 
(orthogonal normalized) vectors e, @, and e (see 
Figure 1.4.8), 


(a) express each of e, @,, and e in terms of i, j, kand 
(x, y, z); and 


(b) calculate @ x j both analytically, using part (a), and 


geometrically. 


13. Using spherical coordinates and the orthonormal 
(orthogonal normalized) vectors e,, @), and ey [see 
Figure 1.4.8(b)], 


(a) express each of e@,, @, and & in terms of i, j, K and 


(x, y, z); and 


14. 


15. 


17. 


19, 


20. 
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(b) calculate œ x j and @ x j both analytically and 
geometrically. 


Express the plane z = x in (a) cylindrical, and (b) 
spherical coordinates. 


Show that in spherical coordinates: 


(a) pis the length of xi + yj + zk 
(b) ġ =cos7!(v-k/||viJ), where v = xi + yj + zk 
(c) 6 =cos~!(u-i/|jull), where u= xi + yj. 


Two surfaces are described in spherical coordinates by 
the two equations p = f (6, ) and 

p =—2f(0, ), where f(0, d) is a function of two 
variables. How is the second surface obtained 
geometrically from the first? 


A circular membrane in space lies over the region 

x? + y? < a2. The maximum z component of points in 
the membrane is b. Assume that (x, y, z) is a point on 
the membrane. Show that the corresponding point 

(r, 0, z) in cylindrical coordinates satisfies the 
conditions 0 < r <a,0 < 0 < 2x, |z| < b. 


A tank in the shape of a right-circular cylinder of radius 
10 ft and height 16 ft is half filled and lying on its side. 
Describe the air space inside the tank by suitably chosen 
cylindrical coordinates. 


A vibrometer is to be designed that withstands the 
heating effects of its spherical enclosure of diameter d, 
which is buried to a depth d/3 in the earth, the upper 
portion being heated by the sun (assume the surface is 
flat). Heat conduction analysis requires a description of 
the buried portion of the enclosure in spherical 
coordinates. Find it. 


An oil filter cartridge is a porous right-circular cylinder 
inside which oil diffuses from the axis to the outer 
curved surface. Describe the cartridge in cylindrical 
coordinates, if the diameter of the filter is 4.5 inches, the 
height is 5.6 inches, and the center of the cartridge is 
drilled (all the way through) from the top to admit a 


3-inch-diameter bolt. 


Describe the surface given in spherical coordinates by 
p = 0820. 


(a) Find all points p € R? that have the same 
representation in both Cartesian and spherical 
coordinates. 


(b) Find all points p € R? that have the same 
representation in both Cartesian and cylindrical 
coordinates. 
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1.5 n-Dimensional Euclidean Space 


Vectors in n-space 


In Sections 1.1 and 1.2 we studied the spaces R = Rt, R?, and R? and gave geometric 
interpretations to them. For example, a point (x, y, z) in R? can be thought of as a 
geometric object, namely, the directed line segment or vector emanating from the origin 
and ending at the point (x, y, z). We can therefore think of IR? in either of two ways: 


(i) Algebraically, as a set of triples (x, y, z), where x, y, and z are real numbers 


(ii) Geometrically, as a set of directed line segments 


These two ways of looking at IR? are equivalent. For generalization it is easier to 
use definition (i). Specifically, we can define R”, where n is a positive integer (possibly 
greater than 3), to be the set of all ordered n-tuples (x1, x2,..., Xn), where the x; are 
real numbers. For instance, (1, /5, 2, V3) € R*. 

The set R” so defined is known as euclidean n-space, and its elements, which we 
write aS X = (x1, x2,..., Xn), are Known as vectors or n-vectors. By setting n = 1, 2, 
or 3, we recover the line, the plane, and three-dimensional space, respectively. 

Welaunch our study of euclidean n-space by introducing several algebraic operations. 
These are analogous to those introduced in Section 1.1 for R? and R3. The first two, 
addition and scalar multiplication, are defined as follows: 


(i) (x1, X2, eenn) + (Yi Ya ea Ya) = (x1 + yr x2 + ya, see Xn +y); 
and 


(ii) for any real number ox, 


ON X2, <- gy) = (Oy Od; «024 OMA): 


The geometric significance of these operations for R? and IR? was discussed in 
Section 1.1. 
Then vectors 


e =(1,0,0,...,0),e@ =(0,1,0,...,0),...,e¢,=(0,0,..., 0, 1) 


are called the standard basis vectors of IR”, and they generalize the three mutually 
orthogonal unit vectors i, j, kof R?. The vector x = (x1, x2, ..., xn) can then be written 
as X= x41 +x +- Xe. 

For two vectors x = (x1, x2, x3) and y = (1, y2, y3) in R?, we defined the dot or 
inner product X- y to be the real number X- Y = x1y1 + x2y2 + x3y3. This definition 
easily extends to R”; specifically, for X = (x1, x2,...,X%n), Y = (yi Ya ---1 Yn), WE 
define the inner product of xand yto be X- y= x11 + x2y2 +--+ + Xnyn. IN R”, the 
notation (x, y) is often used in place of x- yfor the inner product. 

Continuing the analogy with R?, we are led to define the notion of the length or norm 
of a vector xby the formula 


If xand yare two vectors in the plane (IR) or in space (IR), then we know that the 
angle 6 between them is given by the formula 
x-y 


d= : 
088 = TXY 
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The right side of this equation can be defined in R” as well as in R? or R?. It still 
represents the cosine of the angle between x and y; this angle is geometrically well 
defined, because xand ylie in a two-dimensional subspace of R” (the plane determined 
by xand y) and our usual geometry ideas apply to such planes. 

It will be useful to have available some algebraic properties of the inner product. 
These are summarized in the next theorem [compare with properties (i), (ii), (iii), and 
(iv) of Section 1.2]. 


Theorem 3 For x, y, Ze R” anda, £, real numbers, we have 


(i) («x+ By) -Z= a(x: z) + (y: 2). 
(ii) x-y=y-x 
(iii) x-x> 0. 


(iv) x- x= 0 if and only if x= Q 


proof Each of the four assertions can be proved by a simple computation. For ex- 
ample, to prove property (i) we write 


(ax+ By) -Z= (ax, + Byr ax, + By2, -- -r AXn + Byn) * (Z1, 22 -< -1 Zn) 
= (ax, + Byi)Z1 + (ax, + By2)z2 + +++ + (Xn + BYn)Zn 
= axyz1 + Byiz1 + x222 + By2zZ2 + -+ © + AXnZn + BYnZn 
=a(x-2) + p(y- 2. 


The other proofs are similar. E 


In Section 1.2, we proved an interesting property of dot products, called the Cauchy- 
Schwarz inequality.* For R? our proof required the use of the law of cosines. For R” we 
could also use this method, by confining our attention to a plane in R”. However, we 
can also give a direct, completely algebraic proof. 


Theorem 4 Cauchy-Schwarz Inequality in R” Letx, ybe vectors in R”. 
Then 


proof Leta = y-yand b = —X- y. If a = 0, the theorem is clearly valid, because 
then y = Oand both sides of the inequality reduce to 0. Thus, we may suppose a Æ 0. 


4Sometimes called the Cauchy-B unyakovskii- Schwarz inequality, or simply the CBS inequality, be- 
cause it was independently discovered in special cases by the French mathematician Cauchy, the Russian 
mathematician Bunyakovskii, and the German mathematician Schwarz. 
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example | 


solution 


By Theorem 3 we have 


0 < (ax+ by) -(ax+ by) = a°X- X+ 2abx-y+b’y-y 
= (y- yx: x= (ys W(x: y)’. 
Dividing by y- y gives 0 < (y-y)(x-x) — (x- y)2, that is, (x-y)? < (x-»(y-y) = 
\|xI|7{lyil?. Taking square roots on both sides of this inequality yields the desired 
result. E 


Thereis a useful consequence of the Cauchy- Schwarz inequality in terms of lengths. 
The triangle inequality is geometrically clear in IR? and was discussed in Section 1.2. 
The analytic proof of the triangle inequality that we gave in Section 1.2 works exactly 
the same in R” and proves the following: 


Corollary Triangle Inequality in R” Let x, ybe vectors in R”. Then 


[x+ yl] < I] + IYI. 


If Theorem 4 and its corollary arewritten out algebraically, they become the following 


useful inequalities: 
n n 1/2 n 1/2 
Daas (52) ($) ; 
i=l i=l i=1 
E 1/2 a 1/2 i 1/2 
(S) <(3>¥) H£») 
i=l i=l 


i=1 


Let x= (1, 2, 0, —1) and y= (—1, 1, 1, 0). Verify Theorem 4 and its corollary in this 
case. 


IIx] = v12 + 22+ 02+ (-1)? = V6 
Iyl = V-I +P +P +e = v3 
x-y=1(—1)+2-1+0-1+(-1)0=1 
x+y= (0,3,1, —1) 
Ix+yl = V? +3 +P + (-1) = VI. 


Wecomputex: y= 1 < 4.24 ~ /6V3 = |XI| liyi], which verifies Theorem 4. Similarly, 
we can check its corollary: 


Ix+yi| = v11 ~ 3.32 
< 4.18 = 2.45 + 1.73 ~ V6 + v3 = |X + Iyl- a 


By analogy with R3, we can define the notion of distance in R”; namely, if xand y 
are points in R”, the distance between xand yis defined to be ||x— yj, or the length of 
the vector x— y. We do not attempt to define the cross product on R” except for n = 3. 


example 2 


example 3 
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General Matrices 


Generalizing 2 x 2 and 3 x 3 matrices (see Section 1.3), we can consider m x n matrices, 
which are arrays of mn numbers: 


a1 412 Aln 

421 an2 An 
A= . 

Amı Am2 *** Amn 


We shall also write A as [a;]. We define addition and multiplication by a scalar 
componentwise, just as we did for vectors. Given two m x n matrices A and B, wecan 
add them to obtain a new m x n matrix C = A + B, whose ij th entry c; is the sum of 
a, and bj. Itis clear that A + B = B+ A. 


(b) [1 2]+[0 -1]=[1 1. 
21) fl o} fi i 
(c) f -b eli if A 


Given a scalar à and an m x n matrix A, we can multiply A by à to obtain a new 
m x n matrix AA = C, whose ijth entry c; is the product Aa;. 


1 -1 2 3 -3 6 
3/0 1 5;=/]0 3 15). 
1 0 3 3 0 9 A 


Next we turn to matrix multiplication. If A = [a,], B = [by] are n x n matrices, 
then the product AB = C has entries given by 


n 
Cij = X dikbkj 
k=1 


which is the dot product of the ith row of A and the jth column of B: 


jth column 
Ho Ap | [bire | by || e bin 
ith row ————> |G = |an : Ain 
: et 
an1 faa Ann brie | bnj yee bnn 
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example 4 


example 5 


example 6 


Let 
1 0 3 010 
A= |2 1 0 and B=]|1 0 0 
1 0 0 0 1 1 
Then 
0 4 3 2. 1 0 
AB= |1 2 0 and BA= |1 0 3 
0 1-0 3 1 0 


Observe that AB + BA. A 


Similarly, we can multiply an m x n matrix (m rows, n columns) by an n x p matrix 
(n rows, p columns) to obtain an m x p matrix (m rows, p columns) by the same rule. 
Note that for AB to be defined, the number of columns of A must equal the number of 
rows Of B. 


Let 
10 2 
ai a S| and B=1|10 2 1 
T 1 1 
Then 
3 15 
aB=|; 4 I 


and BA is not defined. A 


Let 
1 
2 
A= 1 and B=[2 2 1 2j: 
3 
Then 
2 2 1 2 
4 424 
AB = 22 12 and BA = [13]. 
6 6 3 6 A 


Any m x n matrix A determines a mapping of R” to R” defined as follows: Let 
X= (x1,..., Xn) € R”; consider then x 1 column matrix associated with x, which we 
shall temporarily denote x" 


example 7 
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and multiply A by x’ (considered to be an n x 1 matrix) to get anew m x 1 matrix: 


ail ERR Aln X1 yı 
AX — x — — y‘, 
Am1 >t Amn Xn Ym 
corresponding to the vector y = (y1,..., Ym). Thus, although it may cause some 
confusion, we will write x= (x1,...,x,) and y= (y1, ..., Ym) aS column matrices 
xı yı 
x= , y= 
Xn Ym 


when dealing with matrix multiplication, that is, we will identify these two forms of 
writing vectors. Thus, we will delete the T on x’ and view x’ and xas the same. 

Thus, Ax = ywill “really” mean the following: W rite xas a column matrix, multiply 
it by A, and let y be the vector whose components are those of the resulting column 
matrix. The rule x —> Ax therefore defines a mapping of R” to R”. This mapping is 
linear; thatis, it satisfies 


A(x+ y) = AX+ Ay 
A(ax) = a( AX), a a scalar, 


as may be easily verified. One learns in a linear algebra course that, conversely, any 
linear transformation of R” to R” is representable in this way by an m x n matrix. 

If A = [a;;] is an m x n matrix and ẹ is the jth standard basis vector of R”, then 
Ae is a vector in R” with components the same as the jth column of A. That is, the 
ith component of Ae; is a;;. In symbols, (A6); = aij. 


If 


1 
-1 
2 
-1 


A= 


Nr OO 
RN Fr Ww 


then x> Axof R? to R4 is the mapping defined by 


A xı + 3x3 
: —X1 + X3 
X2 
: 2x1 + x2 + 2x3 
i —x1 + 2x2 + x3 A 
>To use a matrix A to get a mapping from vectors X = (x,...,x,) to vectors y= (¥1,..., Yn) 
according to the equation Ax’ = y”, we write the vectors in the column form x’ instead of the row 
form (x1, ..., Xn). This sudden switch from writing xas a row to writing xas a column is necessitated 


by standard conventions on matrix multiplication. 
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example 8 


example 9 


The following illustrates what happens to a specific point when mapped by a 4 x 3 
matrix: 


= 2nd column of A. 


e NON 
N Ww A O 


C | 

or oO 

a | 
II 

e NAN 

> 


Properties of Matrices 


M atrix multiplication is not, in general, commutative: If A and B are n x n matrices, 
then generally 


AB & BA, 


as Examples 4, 5, and 6 show. 
Ann x n matrix is said to be invertible if there is an n x n matrix B such that 


AB = BA = I, 
where 
100. 0 
010 0 
p= | 0 1 0 
000 c 1 


is the n x n identity matrix: 7, has the property that 7,C = CI, = C foranyn xn 
matrix C. We denote B by A~} and call A~? the inverse of A. The inverse, when it 
exists, is unique. 


If 


240 [4-8 4 
A=1|0 2 1j, then Ss, 3 4 -2], 
3 0 2 Ole 12 4 


because AA~? = 73 = A~?A, as may be checked by matrix multiplication. A 


M ethods of computing inverses are learned in linear algebra; we won't require these 
methods in this book. If A is invertible, the equation Ax = ycan be solved for the vector 
xby multiplying both sides by A~? to obtain® x = A~ty. 

In Section 1.3 we defined the determinant of a 3 x 3 matrix. This can be generalized 
by induction to n x n determinants. We illustrate here how to write the determinant of 


5!n fact, Cramer's rule from Section 1.3 provides one way to invert matrices. Numerically more efficient 
methods based on elimination methods are learned in linear algebra or computer science. 


example 10 


solution 
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a4 x 4 matrix in terms of the determinants of 3 x 3 matrices: 


411 412 413 14 
421 422 423 A24 


a22 A23 424 a21 423 424 


= 411 |432 433 434| — a12 |431 433 434 
431 432 433 434 
a42 A43 ay a4, a43 ay 
a4) Ap A43 A44 
a2) an 44 a21 an 423 
+a |431 432 434| — a14 |431 432 433 
a4, Ap Ay a4, Ap 443 


[see formula (2) of Section 1.3; the signs alternate +, —, +, —]. 

The basic properties of 3 x 3 determinants reviewed in Section 1.3 remain valid for 
n xn determinants. In particular, we note the fact thatif A isan n x n matrix and B isthe 
matrix formed by adding a scalar multiple of one row (or column) of A to another row 
(or, respectively, column) of A, then the determinant of A is equal to the determinant of 
B (see Example 10). 

A basic theorem of linear algebra states that an n x n matrix A is invertible if and 
only if the determinant of A is not zero. A nother basic property is that the determinant is 
multiplicative: det (AB) = (det A)(det B). In this text, we shall not make use of many 
details of linear algebra, and so we shall leave these assertions unproved. 


Let 


eNe 
a 
oore 
Neno 


Find det A. Does A have an inverse? 


Adding (—1) x first column to the third column, we get 


1 00 
det A=|1 -2 o= i| =-2 
TEE 


Thus, det A = —2 40, and so A has an inverse. A 


If we have three matrices A, B, and C such that the products AB and BC are 
defined, then the products (AB)C and A( BC) are defined and are in fact equal (i.e., 
matrix multiplication is associative). We call this the triple product of matrices and 
denote it by ABC. 
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example 1] Let 


5 2 
Then 
3 9 
asc=atac=[3]a1=[3]. 
example 12 2 0] fl 1] f0 -1) f2 O] fl 0) [2 0 
0 1/1 i) /1 1) JO 1} }1 0| J1 0 A 


The founder of modern (coordinate) geometry was René Descartes (see 

Figure 1.5.1), a great physicist, philosopher, and mathematician, as well as a 

founder of modern biology. 
Historical Note Born in Touraine, France, in 1596, Descartes had a fascinating life. After 
studying law, he settled in Paris, where he developed an interest in mathematics. 
In 1628, he moved to Holland, where he wrote his only mathematical work, La 
Geometrie, one of the origins of modern coordinate geometry. 

Descartes had been highly critical of the geometry of the ancient Greeks, with 
all their undefined terms and with their proofs requiring ever newer and more 
ingenious approaches. For Descartes, this geometry was so tied to geometric 
figures “that it can exercise the understanding only on condition of greatly 
fatiguing the imagination.” He undertook to exploit, in geometry, the use of 
algebra, which had recently been developed. The result was La Geometrie, 
which made possible analytic or computational methods in geometry. 

Remember that the Greeks were, like Descartes, philosophers as well as 
mathematicians and physicists. Their answer to the question of the meaning of 
space was “Euclidean geometry.” Descartes had therefore succeeded in 
“algebrizing” the Greek model of space. 

Gottfried Wilhelm Leibniz, cofounder (with Isaac Newton) of calculus, was also 
interested in “space analysis,” but he did not think that Descartes’s algebra went 


figure 1.5.1 René Descartes 
(1596-1650). far enough. Leibniz called for a direct method of space analysis (analysis situs) 


that could be interpreted as a call for the development of vector analysis. 
On September 8, 1679, Leibniz outlined his ideas in a letter to Christian 
Huygens: 


I am still not satisfied with algebra, because it does not give the shortest 
methods or the most beautiful constructions in geometry. This is why | 
believe that, so far as geometry is concerned, we need still another 
analysis which is distinctly geometrical or linear and which will express 
situation (situs) directly as algebra expresses magnitude directly. And | 
believe that | have found the way and that we can represent figures 
and even machines and movements by characters, as algebra 
represents numbers or magnitudes. | am sending you an essay which 
seems to me to be important. 


In the essay, Leibniz described his ideas in greater detail. 


exercises 
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L Calculate the dot product of x= (1, —1, 0, 2) € R4 and 
y= (1,2, 3,4) € Rf. 
2 In R” show that 


(a) 21x? + 21y? = x+ yl? + lx- yil? (This is 
known as the parallelogram law.) 


(b) x — yilllx+ Il < Ixl? + Wy? 


(c) 4% y) = X+ yll? — IIx— yll? (This is called the 
polarization identity. ) 


Interpret these results geometrically in terms of the parallelogram formed by xand y. 


Verify the Cauchy—Schwarz inequality and the triangle inequality for the vectors in Exercises 3 to 6. 


3 x= (2,0, —1), y= (4, 0, —2) 

4. x= (1,0, 2, 6), y= (3, 8, 4, 1) 

5. x= (1, —1, 1, -1, 1), y= (3, 0, 0, 0, 2) 
6& x= (1,0, 0,1), y= (—1, 0, 0, 1) 


7. Letv, We R”. If |v) = |wi], show that v+ wand 
v— ware orthogonal. 


8. Suppose T is a triangle formed by placing three points 
on a circle, two of which lie on the circle’s diameter. Use 
the previous problem to show T is a right triangle. 


9. Compute AB, det A, det B, det (AB), and det (A + B) 


for 
1 -1 0 —2 0 2 
A=]|0 32 and B=]-1 1 -1)]. 
3 I | 1 4 3 
10. Compute AB, det A, det B, det (AB), and det (A + B) 
for 
3 0 1 1 0 -1 
A=|1 2 -1 and B=|2 0 1}. 
10 1 01 0 


In Exercises 16 to 18, A, B, and C denote n x n matrices. 


16. Is det (A+ B) = det A + det B? Give a proof or 
counterexample. 


17. Does (A + B)(A — B) = A? — B2? 


18 Assuming the law det (AB) = (det A)(det B), prove 
that det (ABC) = (det A)(det B)(det C). 


19. (This exercise assumes a knowledge of integration of 
continuous functions of one variable.) N ote that the 
proof of the Cauchy- Schwarz inequality (Theorem 4) 


11. Determine which of the following matrices are 
invertible: 


123 003 11 
A=]|011 B= |—1119 gf iil 
033 23a 
12. For matrix A in the previous problem, find a nonzero 
x€ R? such that Ax= Q 


13. Use induction on k to prove that if x, ..., X € R”, then 


XL eb Kel] S WAN Pe |e 


14. Using algebra, prove the identity of Lagrange: For real 
numbers x1,...,X, and y1,..., Yn- 


Goris pce. 


i] i=1 i=1 i<j 


Use this to give another proof of the Cauchy- Schwarz 
inequality in R”, 


15. Provethatif Aisan n x n matrix, then 


(a) det (AA) = A” det A; and 


(b) if B is a matrix obtained from A by multiplying any 
row or column by a scalar A, then det B = à det A. 


depends only on the properties of the inner product 
listed in Theorem 1. U se this observation to establish the 
following inequality for continuous functions 

f, g: [0,1] > R: 


1 1 1 
| f(x)g(x) dx} < i [f(x)]? dx Ji le(x)]? dx. 
0 0 0 


Do this by 


(a) verifying that the space of continuous functions from 
[0, 1] to IR forms a vector space; that is, we may think of 


The Geometry of Euclidean Space 


functions f, g abstractly as “vectors” that can be added 
to each other and multiplied by scalars. 


(b) introducing the inner product of functions 


1 
s=] F(x)g(x) dx 
0 


and verifying that it satisfies conditions (i) to (iv) of 
Theorem 3. 


20. Define the transpose AT of an n x n matrix A as 
follows: the ijth element of AT is aji where a;; is the 
ijth entry of A. Show that A” is characterized by the 
following property: For all x yin R”, 


(ATX) -y= X: (Ay). 
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22. 


Verify that the inverse of 


CER 
cod ad—bc|—c al’ 


Use your answer in Exercise 21 to show that the solution 
of the system 

ax+by =e 

cxt+dy=f 


G] = = E E P 
Assuming the law det (AB) = (det A)(det B), verify 


that (det A) (det A~!) = 1 and conclude that if A has an 
inverse, then det A 4 0. 


Find two 2 x 2 matrices A and B such that AB = 0 but 
BA #0. 


L Let v= 3i+4j+5kand w=i—j+k Compute 
v+ W, 3v, 6v+ 8w, —2v, v- w, v x w Interpret 
each operation geometrically by graphing the 
vectors. 


2. Repeat Exercise 1 with v= 2j + kand w= —i—-k 


3. (a) Find the equation of the line through (—1, 2, —1) in 
the direction of j. 


(b) Find the equation of the line passing through 
(0, 2, —1) and (—3, 1, 0). 


(c) Find the equation for the plane perpendicular to the 
vector (—2, 1, 2) and passing through the point 
(1,1, 3). 


4. (a) Find the equation of the line through (0, 1, 0) in the 
direction of 3i + k. 


(b) Find the equation of the line passing through 
(0, 1, 1) and (0, 1, 0). 


(c) Find an equation for the plane perpendicular to the 
vector (—1, 1, —1) and passing through the point 
(ERN: 


5. Find an equation for the plane containing the points 
(2, 1, —1), (3, 0, 2), and (4, —3, 1). 


9. 


Find an equation for a line that is parallel to the plane 
2x — 3y + 5z — 10 = 0 and passes through the point 
(—1, 7, 4). (There are lots of them.) 


a Compute v- wfor the following sets of vectors: 


(a) v=—-i+j;w=k 
(b) v=i+2j-kw=3i+j 
(c) v=-—2i-j+k w= 3i + 2j- 2k 


Compute v x wfor the vectors in Exercise 7. [Only 
part (b) is solved in the Study Guide.] 


Find the cosine of the angle between the vectors in 
Exercise 7. [Only part (b) is solved in the Study Guide.] 


Find the area of the parallelogram spanned by the vectors 
in Exercise 7. [Only part (b) is solved in the Study 
Guide.] 


Use vector notation to describe the triangle in space 
whose vertices are the origin and the endpoints of 
vectors aand b. 


Show that three vectors a, b, clie in the same plane 
through the origin if and only if there are three scalars 
a, B, y, not all zero, such that ~a + Bb+ yc= Q 


17. 


For real numbers a1, a2, a3, b1, b2, b3, show that 
(ayby + azb + a3b3)? < (a? + a3 + aĝ)(b + b$ + b3). 


Let u, v, whe unit vectors that are orthogonal to each 
other. If a= u+ BV+ yw, show that 


a=a-u, pray, y=aw 


Interpret the results geometrically. 


Find the products AB and BA where 


15 2 2 0 1 
a= fo 2 | a= |: 3 Jl 
10 2 24 1 


Find the products AB and BA where 


2 12 3.0 5 
ali o | a= [1 > i. 
1 3 0 03 1 


Leta bbe two vectors in the plane, 

a= (a1, a2), b= (bı, b2), and let à be a real number. 
Show that the area of the parallelogram determined by a 
and b+ Aais the same as that determined by aand b. 
Sketch. Relate this result to a known property of 
determinants. 


Find the volume of the parallelepiped determined by the 
vertices (0, 1, 0), (1, 1, 1), (0, 2, 0), (3, 1, 2). 


Given nonzero vectors aand bin R3, show that the 
vector v = ||alib+ |ibjja bisects the angle between a 
and b. 


Show that the vectors ||b|ja+ |/al/band |b|a-— |jal|bare 
orthogonal. 


Use the triangle inequality to show that 
Iv- wi > [ivi — iwi. 


Use vector methods to prove that the distance from the 
point (x1, y1) to the line ax + by = c is 


jax1 + by1 — c| 


Va? +b? 


23. Verify that the direction of b x cis given by the 


right-hand rule, by choosing b, cto be two of the vectors 
i, j, and k. 


(a) Suppose a- b= a - bfor all b. Show that a = a. 


(b) Suppose ax b= a x bfor all b. Is it true that 
a=a? 
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25. (a) Using vector methods, show that the distance 


between two nonparallel lines /; and /2 is given by 


_ (v — vı) - (a x @&)| 
la x æl 


d 


where vı, W are any two points on /; and /2, respectively, 
and a and & are the directions of Z1 and /2. [HINT: 
Consider the plane through /2 that is parallel to 7. Show 
that the vector (a x @)/||a@, x @|| iS a unit normal for 
this plane; now project wz — vi onto this normal 
direction.] 


(b) Find the distance between the line /1 determined by 


the points (—1, —1, 1) and (0, 0, 0) and the line /2 
determined by the points (0, —2, 0) and (2, 0, 5). 


Show that two planes given by the equations 
Ax + By + Cz + Dı = Qand Ax + By+Cz+ D2 =0 
are parallel, and that the distance between them is 


|D1 — D2| 


VA? + BE+ C2 


» (a) Prove that the area of the triangle in the plane with 


vertices (x1, y1), (x2, y2), (x3, y3) is the absolute 
value of 


(b) Find the area of the triangle with vertices 
(1, 2), (0, 1), (—1, 1). 


Convert the following points from Cartesian to 
cylindrical and spherical coordinates and plot: 


(a) (0, 3, 4) (d) (—1,0, 1) 
(b) (—4/2, 1, 0) (e) (—2v3, —2, 3) 
(c) (0, 0, 0) 


Convert the following points from cylindrical to 
Cartesian and spherical coordinates and plot: 


(a) (1, 2/4, 1) 
(b) (3, 2/6, —4) 
(c) (0, 2/4, 1) 


(d) (2, —x/2, 1) 
(e) (—2, —2/2, 1) 


Convert the following points from spherical to Cartesian 
and cylindrical coordinates and plot: 


(a) (1, 2/2, x) 
(b) (2, —2/2, 2/6) 
(c) (0, 2/8, 2/35) 


(d) (2, —2/2, =x) 
(e) (—1, x, 2/6) 
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BL Rewrite the equation z = x? — y? using cylindrical and 


spherical coordinates. 


32. Using spherical coordinates, show that 


38. 


39. 


$ = cos! (=), 


where u = xi + yj + zk Interpret geometrically. 
Verify the Cauchy-Schwarz and triangle inequalities for 
x= (3, 2,1, 0) and y=(1,1,1,2). 


Multiply the matrices 


3 0 1 101 
A=|2 0 1 and B= |l LL. 
101 0 0 1 


Does AB = BA? 


(a) Show that for two n x n matrices A and B, and 
xe R”, 


(AB)x= A(BX. 


(b) W hat does the equality in part (a) imply about the 
relationship between the composition of the 
mappings X+> BX, yr> Ay, and matrix 
multiplication? 


Find the volume of the parallelepiped spanned by the 
vectors 
(1,0, 1), 


(1,1,1), and (—3, 2, 0). 


«a (For students with some knowledge of linear algebra.) 


Verify that a linear mapping T of R” to R” is determined 
by ann x n matrix. 


Find an equation for the plane that contains (3, —1, 2) 
and the line with equation v= (2, —1, 0) + r(2, 3, 0). 


The work W done in moving an object from (0, 0) to (7, 
2) subject to a constant force Fis W = F-r, whereris 
the vector with its head at (7, 2) and tail at (0, 0). The 
units are feet and pounds. 


(a) Suppose the force F = 10 cos 6i + 10 sin 6j. Find 
W in terms of 8. 


(b) Suppose the force F has magnitude of 6 Ib and 


makes an angle of 2/6 rad with the horizontal, 
pointing right. Find W in foot-pounds. 


40. If a particle with mass m moves with velocity v, its 


momentum iS P= mv. In a game of marbles, a marble 
with mass 2 grams (g) is shot with velocity 2 meters per 
second (m/s), hits two marbles with mass 1 g each, and 
comes to a dead halt. One of the marbles flies off with a 
velocity of 3 m/s at an angle of 45° to the incident 
direction of the larger marble, as in Figure 1.R.1. 
Assuming that the total momentum before and after the 
collision is the same (according to the law of 
conservation of momentum), at what angle and speed 


does the second marble move? 
3 msa 
I 1g 714 
AS aes 


o! 
~> 


$ A 
29 J\N 
lg 


figure 1.R.1 Momentum and marbles. 


2m/s 


Show that for all x, y, z, 


x+2 y z y x+2 z 
z y+1 10)/=-]1 z-x-2 10-z 
5 5 2 5 5 2 
Show that 
l x x 
1 y y? |#0 
1 z z 


if x, y, and z are all different. 


Show that 
66 628 246 68 627 247 
88 435 24|/=|86 436 23l. 
2 -1 1 2 =1 l 
Show that 
n n+1 n+2 
n+3 n+4 n+5 
n+6 n+l] n+8 


has the same value no matter what n is. What is this 
value? 


45. Are the following quantities vectors or scalars? 


(a) The current population of Santa Cruz, California 
(b) The torque a cyclist exerts on her bicycle 


(c) The velocity of wind blowing through a weather 
vane 


(d) The temperature of a pizza in an oven 


46. Find a4 x 4 matrix C such that for every 4 x 4 matrix A 
we have CA = 3A. 


1'i 
a= [o 1l 
(a) Find A71, B71, and (AB)-1. 


(b) Show that (AB)~! 4 A~!B-! but 
(AB)! = Bla} 


47. Let 


Use the following definition for Exercises 50 and 51: Lett, .. 
center of mass is the vector 


50. A tetrahedron sits in xyz coordinates with one vertex at 
(0, 0, 0), and the three edges concurrent at (0, 0, 0) are 
coincident with the vectors a, b, c 


(a) Draw a figure and label the heads of the vectors 
abc 


(b) Find the center of mass of each of the four triangular 
faces of the tetrahedron if a unit mass is placed at 
each vertex. 


In Exercises 52 to 57, find a unit vector that has the given property. 


52. Parallel to the line x = 3r+1, y = 16t — 2, 
z=—(t+2) 


53. Orthogonal to the plane x — 6y + z = 12 


54. Parallel to both the planes 8x + y + z = 1 and 


K—yr-2= 


., F, be vectors in R3 from 0 to the masses mı, .. 
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48. Suppose ° A is invertible and has integer entries. 


=l 
What conditions must be satisfied for A = k r 
to have integer entries? 

49. The volume of a tetrahedron with concurrent edges a b, 
cis given by V = ła. (bx ©. 


(a) Express the volume as a determinant. 


(b) Evaluate V when 
a=i+j+k b=i-j+kc=i+j. 


., Mn. The 


5L Show that for any vector r, the center of mass of a 
system satisfies 


n n 


So millr— rl? = So mlr — cl? + mlr- e’, 


i=1 i=l 


where m = )~"_, m; is the total mass of the system. 


55. Orthogonal to i+ 2j — Kand to k 


56. Orthogonal to the line 
x = 2t — l, y = —t — 1, z = t + 2, and the vector i — j 


57. Atan angle of 30° to i and making equal angles with j 
and k 
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Differentiation 


I turn away with fright and horror from the lamentable evil of functions which do not have 


derivatives. —Charles Hermite, 
in a letter to Thomas Jan Stieltjes 


This chapter extends the principles of differential calculus for functions 
of one variable to functions of several variables. We begin in Section 2.1 
with the geometry of real-valued functions and study the graphs of these 
functions as an aid in visualizing them. Section 2.2 gives some basic def- 
initions relating to limits and continuity. This subject is treated briefly, be- 
cause it requires time and mathematical maturity to develop fully and is 
therefore best left to a more advanced course. Fortunately, a complete 
understanding of all the subtleties of the limit concept is not necessary for 
our purposes; the student who has difficulty with this section should bear 
this in mind. However, we hasten to add that the notion of a limit is cen- 
tral to the definition of the derivative, but not to the computation of most 
derivatives in specific problems, as we already know from one-variable 
calculus. Sections 2.3 and 2.5 deal with the definition of the derivative, 
and establish some basic rules of calculus: namely, how to differentiate 
a sum, product, quotient, or composition. In Section 2.6, we study direc- 
tional derivatives and tangent planes, relating these ideas to those in 
Section 2.1. Finally, the Internet supplement gives some of the technical 
proofs. 

In generalizing calculus from one dimension to several, it is offen con- 
venient to use the language of matrix algebra. What we shall need has 


been summarized in Section 1.5. 


75 


76 Differentiation 


2.1 The Geometry of Real-Valued Functions 


We launch our investigation of real-valued functions by developing methods for visual- 
izing them. In particular, we introduce the notions of a graph, a level curve, and a level 
surface of such functions. 


Functions and Mappings 


Let f be a function whose domain is a subset A of R” and with a range contained 
in R™. By this we mean that to each x = (X1, ..., Xn) € A, f assigns a value f(x), 
an m-tuple in IR. Such functions f are called vector-valued functions! if m > 1, and 
scalar-valued functions if m = 1. For example, the scalar-valued function f(x, y, z) = 
(x? + y? + 22)-3/? maps the set A of (x, y, z) # (0, 0, 0) in R? (n = 3, in this case) to 
R (m = 1). To denote f we sometimes write 


F(X, yz) Be (x? + y? +27) 37, 
Note that in IR? we often use the notation (x, y, z) instead of (x1, X2, X3). In general, the 
notation x +> f(x) is useful for indicating the value to which a point x € R” is sent. 
Wewrite f: A c R" — R” to signify that A is the domain of f (a subset of IR") and the 
range is contained in R™. We also use the expression f maps A into R™. Such functions 
f are called functions of several variables if A c R",n > 1. 

As another example we can take the vector-valued function g: Rê — R? defined by 
the rule 


Q(x) = 9(X1, X2, X3, Xa, X5, X6) = (x1X2X3X4X5X6, Vx? + x2). 


The first coordinate of the value of g at x is the product of the coordinates of x. 
Functions from R” to R™ are not just mathematical abstractions, they arise natu- 
rally in problems studied in all the sciences. For example, to specify the temperature 
T in a region A of space requires a function T: A c R? > R (n = 3,m = 1); 
thus, T(x, y, z) is the temperature at the point (x, y, z). To specify the velocity of 
a fluid moving in space requires a map V: Rt —> R?, where V(x, y, z, t) is the ve- 
locity vector of the fluid at the point (x, y, z) in space at time t (see Figure 2.1.1). 
To specify the reaction rate of a solution consisting of six reacting chemicals A, B, 
C,D,E,F in proportions x, y, z,w, u, v requires a map o: U c Rê —> R, where 
o(X, Y, z, W, u, v) gives the rate when the chemicals are in the indicated proportions. 


(XY wx y, z0 = Fluid velocity o l 
figure 2.1.1 A fluid in motion defines 


————— a vector field V by specifying the 
velocity of the fluid particles at each 
_ _— point in space and time. 


1Some mathematicians would write such an f in boldface, using the notation f(x), because the function 
is vector-valued. We did not do so, as a matter of personal taste. We use boldface primarily for mappings 
that are vector fields, introduced later. The notion of function was developed over many centuries, with 
the definition extended to cover more cases as they arose. For example, in 1667 J ames Gregory defined 
a function as “a quantity obtained from other quantities by a succession of algebraic operations or 
by any other operation imaginable.” In 1755 Euler gave the following definition: “If some quantities 
depend on others in such a way as to undergo variation when the latter are varied then the former are 
called functions of the latter.” 
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To specify the cardiac vector (the vector giving the magnitude and direction of electric 
current flow in the heart) at time t requires a map c: R > R?, t > c(t). 

When f: U c R” — R, we say that f is a real-valued function of n variables with 
domain U . The reason we say “n variables” is simply that we regard the coordinates 
of a point X = (X1, ..., Xn) E U asn variables, and f(x) = f(xi,..., Xn) depends on 
these variables. We say “real-valued” because f (X1, ..., Xn) Is a real number. A good 
deal of our work will be with real-valued functions, so we give them special attention. 


Graphs of Functions 


For f: U C R —> R(n = 1), the graph of f is the subset of R? consisting of all points 
(x, f(x)) in the plane, for x in U . This subset can be thought of as a curve in R?. In 
symbols, we write this as 


graphf = {(x, f(x)) € R?| x €U}, 


where the curly braces mean “the set of all” and the vertical bar is read “such that.” 
Drawing the graph of a function of one variable is a useful device to help visualize how 
the function actually behaves (see Figure 2.1.2). It will be helpful to generalize the idea 
of a graph to functions of several variables. This leads to the following definition: 


Definition Graph of a Function Let f:U c R" —> R. Define the graph 
of f to be the subset of IR"** consisting of all the points 


(SG coon Mao T On ooen Mal) 


in R+! for (xz, ..., Xn) in U . In symbols, 


Graph i = Wa ooon te Pp cen) SR ine ene UN. 


For the case n = 1, the graph is a curve in R?, while for n = 2, itis a surface in R? 
(see Figure 2.1.2). For n = 3, itis difficult to visualize the graph, because, since we are 
humans living in a three-dimensional world, it is hard for us to envisage sets in R4. To 
help overcome this handicap, we introduce the idea of a level set. 


y Z 


Graph of f 


Graph of f 


(a) (b) 


figure 2.1.2 The graphs of (a) a function of one variable, and (b) a function of two variables. 
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figure 2.1.3 Level contours of a 
function are defined in the same 
manner as contour lines for a 
topographical map. 


example | 


example 2 


Level Sets, Curves, and Surfaces 


Suppose f(x, y,z) = x? + y? +22. A level set is a subset of IR? on which f is 
constant; for instance, the set where x? + y? + z? = 1 is a level set for f. This we 
can visualize: It is just a sphere of radius 1 in R?. Formally, a level set is the set of 
(x, y, Z) such that f (x, y, z) = c, where c is a constant. The behavior or structure of a 
function is determined in part by the shape of its level sets; consequently, understanding 
these sets aids us in understanding the function in question. L evel sets are also useful 
for understanding functions of two variables f(x, y), in which case we speak of level 
curves or level contours. 

The idea is similar to that used to prepare contour maps, where one draws lines to 
represent constant altitudes; walking along such a line would mean walking on a level 
path. In the case of a hill rising from the xy plane, a graph of all the level curves gives 
us a good idea of the function h(x, y), which represents the height of the hill at point 
(x, y) (see Figure 2.1.3). 


The constant function f: R? —> R, (x, y) œ> 2—thatis, thefunction f (x, y) = 2— has 
as its graph the horizontal plane z = 2 in R?. The level curve of value c is empty if 
c Æ 2, and isthe whole xy plane ifc =2. A 


Thefunction f: R? —> R, defined by f(x, y) = x + y +2, has as its graph the inclined 
plane z = x + y + 2. This plane intersects the xy plane (z = 0) in the line y = —x — 2 
and the z axis at the point (0, 0, 2). For any value c € R, the level curve of value c is 
the straight line y = —x + (c — 2); or in symbols, the set 


Le = {(x, y) | yY =—x + (c — 2)} c R°. 


We indicate a few of the level curves of the function in Figure 2.1.4. This is a contour 
map of the function f. 


figure 2.1.4 The level curves of f(x, y= x+ y+2 
show the sets on which f takes a given value. 


Line of 
intersection of 
planez=x+y+2 
and the xy plane 


example 3 


solution 
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x+y+2=4 Level curves litted 
to surface 


xty+2=2 


Level curvex+y+2=0 
x+ty+2=2inxy plane 
x+y+2= 4in xy plane 


figure 2.1.5 The relationship of level curves of Figure 2.1.4 to the graph of 
the function f(x, Y) = x + y+ 2, which is the plane z= x+ y+ 2. 


From level curves labeled with the value or “height” of the function, the shape of the 
graph may be inferred by mentally elevating each level curve to the appropriate height, 
without stretching, tilting, or sliding it. If this procedure is visualized for all level curves, 
L.—that is, for all values c € R, they will assemble to give the entire graph of f, as 
indicated by the shaded plane in Figure 2.1.5. If the graph is visualized using a finite 
number of level curves, acontour model is produced. If f isasmooth function, its graph 
will be a smooth surface, and so the contour model, mentally smoothed over, gives a 
good impression of the graph. A 


Definition Level Curves and Surfaces Let f:U c R" > R and letc e 
R. Then the level set of value c is defined to be the set of those points x € 
U at which f(x) = c. If n = 2, we speak of a level curve (of value c); 
and if n = 3, we speak of a level surface. In symbols, the level set of value 
c is written 


{xeU | f(x) =c} CR’. 


Note that the level set is always in the domain space. 


Describe the graph of the quadratic function 
f: R? > R, (x, y) => Xy. 


The graph is the paraboloid of revolution z = x? + y?, oriented upward from the origin, 
around the z axis. The level curve of value c is empty for c < 0; for c > 0 the level 
curve of value c is the set {(x, y) | x? + y? = c}, a circle of radius „/€ centered at the 
origin. Thus, raised to height c above the xy plane, the level set is a circle of radius ./C, 
indicating a parabolic shape (see Figures 2.1.6 and 2.1.7). 


Differentiation 


example 4 


figure 2.1.6 Some level curves for the function 
f(x, Y= x*+ y. 


figure 2.1.7 Level curves in Figure 2.1.6 raised to 
the graph. 


The Method of Sections 


By a section of the graph of f we mean the intersection of the graph and a (vertical) 
plane. For example, if Pı is the xz plane in R?, defined by y = 0, then the section of f 
in Example 3 is the set 


Pin graph f = {(X, y, Z) | y=0,z=x°}, 


which is a parabola in the xz plane. Similarly, if P2 denotes the yz plane, defined by 
x = 0, then the section 


Pa N graph f = {(x, y,z) |x =0,z = ył} 
is a parabola in the yz plane (see Figure 2.1.8). Itis usually helpful to compute at least 
one section to complement the information given by the level sets. 
The graph of the quadratic function 
f: R? SR, (x, y) > x? — y? 
is called a hyperbolic paraboloid, or saddle, centered at the origin. Sketch the graph. 


solution 
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figure 2.1.8 Two sections of the graph 
of f(x, yY = x* + y?. 


To visualize this surface, MENTS ana the level curves. To determine the level curves, we 
Foyere equation x? — y? = c. Consider the values c = 0, +1, +4. Forc = 0, we have 
y? = x? or y = +X, SO thar tnis Ieva setconsists of two a ee 
Forc = 1, the level curve is x? — y? = 1, or y = +/x? — 1, which is a hyperbola that 
passes vertically through the x axis at the points (+1, 0) (see Figure 2.1.9). Similarly, 
forc = 4, the level curve is defined by y = +x? — 4, the hyperbola passing vertically 
through the x axis at (+2, 0). For c = —1, we obtain the curve x? — y? = —1— that is, 
X = +,/y2 — 1— the hyperbola passing horizontally through the y axis at (0, +1). And 
for c = —4, the hyperbola through (0, +2) is obtained. These level curves are shown 
in Figure 2.1.9. B ecause it is not easy to visualize the graph of f from these data alone, 
we shall compute two sections, as in the previous example. For the section in the xz 
plane, we have 


Pi graph of f = {(x, y, z) | y = 0, z = x?}, 
which is a parabola opening upward; and for the yz plane, 


Pa N graph f = {(x, y, z) |x = 0, z = =y°}, 


> x 


figure 2.1.9 Level curves for the function f(x, Y = x? — y. 
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example 5 


solution 


figure 2.1.10 Some level curves on the graph of 
f(x, Y = x? — y?, 


SS 


yaxis 


figure 2.1.11 The graph of z= x? — y? and its level curves. 


which is a parabola opening downward. The graph may now be visualized by lifting the 
level curves to the appropriate heights and smoothing out the resulting surface. Their 
placement is aided by computing the parabolic sections. This procedure generates the 
hyperbolic saddle indicated in Figure 2.1.10. Compare this with the computer- generated 
graphs in Figure 2.1.11 (note that the orientation of the axes has been changed). A 


Describe the level sets of the function 
f: R? >R, (x, y, Z2) X? +y? +z. 


This is the three-dimensional analogue of Example 3. In this context, level sets are 
surfaces in the three-dimensional domain IR?. The graph, in Rt, cannot be visualized 
directly, but sections can nevertheless be computed. 

The level set with value c is the set 


Le = {(x, yz) |X? +y? +z =c), 
which is the sphere centered at the origin with radius ./c for c > 0, is a single point at 


the origin for c = 0, and is empty for c < 0. The level sets for c = 0, 1, 4, and 9 are 
indicated in Figure 2.1.12. 
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xŽ+y? +=]? 
x+y? +e 

2 2 2.32 
eo Hy +43 


figure 2.1.12 Some level surfaces for 
fx, y, D=XŻ+y +z. A 


Describe the graph of the function f: R? —> R defined by f(x, y, z) = x? + y? — 2°, 
which is the three-dimensional analogue of Example 4, and is also called a saddle. 


Formally, the graph of f is a subset of four-dimensional space. If we denote points in 
this space by (x, y, z, t), then the graph is given by 


The level surfaces of f are defined by 
Le = {(x,y,2) |x? yac 


For c = 0, this is the cone z = +,/x2 + y? centered on the z axis. For c negative, 
say, c = —a?, we obtain z = +,/x2+ y? + a2, which is a hyperboloid of two sheets 
around the z axis, passing through the z axis at the points (0, 0, +a). For c positive, say, 
c = b?, the level surface is the single-sheeted hyperboloid of revolution around the 
z axis defined by z = +,/x* + y? — b?, which intersects the xy plane in the circle of 
radius |b|. These level surfaces are sketched in Figure 2.1.13. 


>N 


figure 2.1.13 Some level surfaces of the function f(x, y, D = x* + y? — z?. 
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figure 2.1.14 The y= 0 section of the graph of f(x, y, D = X + y? — z?. 


Another view of the graph may be obtained from asection. For example, the subspace 
Sy=0 = {(X, y, Z, t) | y = 0} intersects the graph in the section 


Sy-0 N graph f = {(x, y,z,t) | y=0,t =x? —z?}, 


that is, the set of points of the form (x, 0, z, x? — z*), which may be considered to bea 
surface in xzt space (see Figure 2.1.14). A 


yaxis 0 


-1 


-2 


-2 -1 0 1 2 
x axis 
(c) 


figure 2.1.15 Computer-generated graph of z = (x? + 3y?) exp (1 — x? — y?) represented in three 
ways: (a) by sections, (b) by level curves on a graph, and (c) by level curves in the xy plane. 
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We have seen how the methods of sections and level sets can be used to under- 
stand the behavior of a function and its graph; these techniques can be quite useful to 
people who desire comprehensive visualization of complicated data. There are many 
computer programs available to do this, and we show the results of one such program 
in Figure 2.1.15. 


1. Arethe following functions vector-valued or 2. Arethe following functions vector-valued or 


scalar-valued? 


(a) f(x, y, z) = e%z* siny 


scalar-valued? 


(a) f(u, v, w) = (u2v, we, 5v) 


(b) g(x, y) = (x?y?, 2x — 1) (b) g(x) = log /x 
(c) h(t) = (cost, sint, t2, t3) (c) h(x, y) = x°y73 


In the following two exercises, match the given level curves with their visual descriptions. 


3. (a) f(x,y) =x? -y =c, 


c=0,1,-1 (b) f(x,y) =2x24+3y2=c, c=6,12 


>x< 


(ii) (iii) 


(iv) (v) 
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4. (a) f(x,y)=(x-y)?=c, c=0,1,4 


(b) f(x,y) =(x+y)?=c, c=0,1,4 


y 
A 


t+—> X 


(iv) (v) 


. Draw the level curves for f of values c. 
(a) f(x,y) =x3-y, c=-1,0,1 
(b) f(x, y)=y—2logx, c = —3,0,3 
(c) f(x,y) =ycscx, c=0,1,2 
(d) f(x, y) =x/(x? + y?), c =—2, 0,4 


» Let f(x, y) = 9x? + y?. Sketch the following. 


(a) The level curves for f of values c = 0, 1,9 


(b) The sections of the graph of f in the planes 
X=-1x=0,x=1 


(c) The sections of the graph of f in the planes 
y=-l,y=0,y=1 
(d) The graph of f 
» Sketch the level curves and graphs of the following 
functions: 
(a) f: R? SR, (x,y) x-y4+2 
(b) f: R?— R, (x, y) > x? + 4y? 
(c) f: R? >R, (x, y) =œ —xy 


» Sketch level sets of values c = 0, 1, 4, 9 for both 
f(x, y) = x? + y? and g(x, y) = \/x2 + y2. How are 
the graphs of f and g different? How are their sections 
different? 


(vi) 


9. Let S be the surface in R? defined by the equation 


x2y6 — 27 = 3, 

(a) Find areal-valued function f(x, y, z) of three 
variables and a constant c such that S is the level set 
of f of valuec. 


(b) Find areal-valued function g(x, y) of two variables 
such that S is the graph of g. 


10. Describe the behavior, as c varies, of the level curve 


f(x, y) = c for each of these functions: 
(a) f(x,y) =X? +y? +1 

(b) f(x,y) =1-x?- y? 

(c) f(x,y) =x3—x 


11. For the functions in Examples 2, 3, and 4, compute the 


section of the graph defined by the plane 
So = {(x, y, Z) | y = x tan 0} 


for a given constant 6. Do this by expressing z as a 
function of r, where x = r cosé, y =r sin 98. 
Determine which of these functions f have the property 
that the shape of the section Sg N graph f is independent 
of 6. (The solution for Example 3 only is in the Study 
Guide.) 
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In Exercises 10 to 16, draw the level curves (in the xy plane) for the given function f and specified values of c. Sketch 
the graph of z = f(x, y). 


12. f(x, y) =4-— 3x 4+ 2y,c =0,1, 2,3, —1, —2, —3 16. f(x, y) =3x —7y,c = 0,1, 2, 3, —1, —2, -3 


13. f(x, y) =(100— x? — y’)1/2,c = 0, 2, 4, 6, 8, 10 17. f(x,y) =x? + xy, c = 0, 1,2, 3, -1, -2, -3 
14. f(x,y) =(x2 + y?)"2,c =0,1,2,3,4,5 18. f(x,y) =x/y,¢ = 0, 1,2, 3, -1, —2, —3 
15. f(x,y) =x2+y2,c =0,1,2,3,4,5 


In Exercises 17 to 19, sketch or describe the level surfaces and a section of the graph of each function. 


19. f: R? —> R, (x, y, Z) œ =x? - y? - 7? 21. f: R? > R, (x,y,z) x24 y? 


20. f: R3? > R, (x, y, Z) 4x2 + y? + 9z? 


In Exercises 20 to 24, describe the graph of each function by computing some level sets and sections. 


22. f: R? —> R, (x, y, z) > xy 25. f: R? —> R, (x,y) 6 ly! 


f: R? > R, (x, y, z) 


f: R? > R, (x, y, 2) 


> Xy + YZ 


> Xy +z? 


26. 


f: R? > R, (x, y) > max (|x|, Iyl) 


Sketch or describe the surfaces in R3 of the equations presented in Exercises 25 to 37. 


27. 4x? +y? =16 37. 4x2 — 3y? + 2z? = 0 
28. x+2z=4 2 y 
o, 38. oo or ge 
29, zt = y4 +4 
' 39. x2? +y? +22 + 4x — by + 9z — b = 0, where b is a 


30. 


x? +y? —2x=0 


constant 


x y z 40. Using polar coordinates, describe the level curves of the 
31. | ea a function defined by 
y2 z x2 De ie dy, 
ee es = ' ' pU 
32 r E f(x, y) = 2xy/(x* + y4) if (x, y) # (0, 0) and f (0, 0) = 0 
=? 41. Let f: R2\{0} — R be given in polar coordinates by 
33. 7=x 
f(r, é) = (cos 26) /r?. Sketch a few level curves in the 
34. y2422=4 xy plane. Here, R2\{0} = {x € R? |x 4 0}. 
y2 x? 42. Show that in Figure 2.1.15, the level “curve” z = 3 
35. z= TO consists of two points. 
36. y? =x? +7? 
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2.2 Limits and Continuity 


This section develops the concepts of open sets, limits, and continuity; open sets are 
needed to understand limits, and limits are in turn needed to understand continuity and 
differentiability. 

Asin elementary calculus, itis not necessary to completely master the limit concept 
in order to work problems in differentiation. For this reason, instructors may treat the 
following material with varying degrees of rigor. The student should consult with the 
instructor about the depth of understanding required. 


Open Sets 


We begin formulating the concept of an open set by defining an open disk. Let Xo € R’ 
and letr be a positive real number. The open disk (or open ball) of radius r and center 
Xo is defined to be the set of all points x such that |/x — Xoll < r. This set is denoted 
D, (Xo) and is the set of points x in IR" whose distance from xo is less than r . Notice that 
we include only those x for which strict inequality holds. The disk D, (Xo) is illustrated 
in Figure 2.2.1 for n = 1, 2, 3. For the case n = 1 and Xo € R, the open disk D, (Xo) is 
the open interval (Xo — r, Xo +1), which consists of all numbers x € R strictly between 
Xo — r and Xo +r. For the case n = 2, Xp € R?, D, (Xo) is the “inside” of the disk of 
radius r centered at Xo. For the case n = 3, Xọ € R?, D, (Xp) is the part strictly “inside” 
of the ball of radius r centered at Xo. 


Definition Open Sets LetU c R” (that is, let U be a subset of R"). We 
call U an open set when for every point Xọ in U there exists somer > 0 such 
that D, (Xo) is contained within U; symbolically, we write D,(Xọ) c U (see 
Figure 2.2.2). 


The number r > 0 can depend on the point Xp, and generally r will shrink as Xo gets 
closer to the “edge” of U . Intuitively speaking, a set U is open when the “boundary” 
points of U do not liein U . In Figure 2.2.2, the dashed line is not included in U . 


y 


| | 


(a) (b) (c) 


figure 2.2.1 What disks D, (Xo) look like in (a) one, (b) two, and (c) three dimensions. 


2.2 Limits and Continuity 89 


figure 2.2.2 An open set U is one that 
completely encloses some disk D, (Xo) about 
each of its points Xo. 


We establish the convention that the empty set Ø (the set consisting of no elements) is 
open. 

We have defined an open disk and an open set. From our choice of terms it would 
seem that an open disk should also be an open set. A little thought shows that this fact 
requires some proof. The following theorem does this. 


Theorem | For each Xọ € R" andr > 0, D, (Xo) is an open set. 


proof Letx e D,(Xo); that is, let ||x — Xoll < r. According to the definition of an 
open set, we must find an s > 0 such that D<(x) c D, (Xo). Referring to Figure 2.2.3, 
we see that s = r — ||x — Xo|| is a reasonable choice; note thats > 0, but that s becomes 
smaller if x is nearer the edge of D , (Xo). 

To prove that D;(x) c D,(Xo), let y € D(x); that is, let |y — x|| < s. We want 
to prove that y € D,(X9) as well. Proving this, in view of the definition of an r-disk, 
entails showing that |y — Xoll < r. This is done by using the triangle inequality for 
vectors in R": 


Iy — Xoll = My — x) + (x — Xo) || < ly — XI] + IX — Xoll < S + [IX — Xl] =r. 
Hence, ||y—Xo|| <r. m 


The following example illustrates some techniques that are useful in establishing the 
openness of sets. 


o 
wi 
¢ 
a 
F r 
i 
figure 2.2.3 The geometry of the 1 ` 
proof that an open disk is an 7 í i d = |x- xoll 
open set. `i Xo I Le ns Ri 
7 fos =r Ix-%0ll 
‘ 4 
s+ 
s af 
Ss p” 
Sa id 
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example | 


solution 


figure 2.2.5 The construction of a 
disk about a point in A that is 
completely enclosed in A. 


Prove that A = {(x, y) € R? | x > 0} is an open set. 


The set is pictured in Figure 2.2.4. 


figure 2.2.4 Show that A is an open set. 


ee B 


Intuitively, this set is open, because no points on the “boundary,” x = 0, arecontained 
in the set. Such an argument will often suffice after one becomes accustomed to the 
concept of openness. At first, however, we should give details. To prove that A is open, 
we show that for every point (x, y) € A there exists anr > 0 such that D, (x, y) CA. 
If (x, y) e A, then x > 0. Chooser = x. If (x1, yi) € D-(x, y), we have 


x1 — X| = vX — xP? < (x1 — x)? + (yi — y)? <r =x, 


and so x; — X < x and x — xX; < x. The latter inequality implies x; > 0, that is, 
(Xz, y1) € A. Hence D,(x, y) c A, and therefore A is open (see Figure 2.2.5), A 


It is useful to have a special name for an open set containing a given point x, 
because this idea arises often in the study of limits and continuity. Thus, by a 
For example, 
D, (Xo) isa neighborhood of Xo for any r > 0. The set A in Example 1 is a neighborhood 
of the point Xo = (3, —10). 


Boundary 


Let us formally introduce the concept of a boundary point, which we alluded to in 
Example 1. 


Definition Boundary Points Let A c R". A pointx € R" is called a bound- 
ary pointof A if every neighborhood of x contains at least one pointin A and at 
least one point notin A. 


In this definition, x itself may or may not bein A; if x € A, then x is a boundary point 
if every neighborhood of x contains at least one pointnotin A (it already contains a point 
of A, namely, x). Similarly, if xis notin A, itis a boundary point if every neighborhood 
of x contains at least one point of A. 

We shall be particularly interested in boundary points of open sets. By the definition 
of an open set, no point of an open set A can bea boundary point of A. Thus, a point x 


example 2 
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isa boundary point of an open set A if and only if xis notin A and every neighborhood 
of x has a nonempty intersection with A. 

This expresses in precise terms the intuitive idea that a boundary point of A isa 
point just on the “edge” of A. In many examples it is perfectly clear what the boundary 
points are. 


(a) Let A = (a, b) in R. Then the boundary points of A consist of the points a and b. 
A consideration of Figure 2.2.6 and the definition will make this clear. [The reader will 
be asked to prove this in Exercise 28(c).] 


Boundary points 
a YP N figure 2.2.6 The boundary points of the interval 
x (ab. 


a b 


(b) LetA = D,(Xo, Yo) beanr-disk about (Xo, Yo) inthe plane. The boundary consists 
of points (x, y) with (x — xo)? + (y — yo)? = r? (Figure 2.2.7). 


y 
Boundary 


figure 2.2.7 The boundary of A consists of points on 
the edge of A. 


A= D; (X, Yo) 


X 


(c) Let A = {(x, y) € R? | x > 0}. Then the boundary of A consists of all points on 
the y axis (draw a figure that depicts this). 


(d) Let A be D,(Xo) minus the point Xo (a “punctured” disk about Xo). Then Xo is a 
boundary pointof A. A 


Limits 
Wenow turn our attention to the concept of alimit. Throughout the following discussions 
We are interested in 
finding the limit of f asx € A approaches either a point of A or a boundary point of A. 
You should appreciate the fact that the limit concept is a basic and useful tool for the 
analysis of functions; it enables us to study derivatives, and hence maxima and minima, 
asymptotes, improper integrals, and other important features of functions, as well as 
being useful for infinite series and sequences. We will present a theory of limits for 
functions of several variables that includes the theory for functions of one variable as a 
special case. 


In one-variable calculus, you have encountered the notion of limitx—x f(x) =I for 
a function f: A C R > R froma subset A of the real numbers to the real numbers. 


Intuitively, this means that as x gets closer and closer to Xo, the values f(x) get closer 
and closer to (the limiting value) |. To put this intuitive idea on a firm, mathematical 
foundation, either the “epsilon (e) and delta (5) method” or the “neighborhood method” 
is usually introduced. The same is true for functions of several variables. In what follows 

The epsilon-delta approach is left for 
optional study at the end of this section. 
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example 3 


figure 2.2.8 Limits in terms of neighborhoods; If x 
isin U, then fO) will be in N. (The little open circle 
denotes that the point does not lie on the graph.) 
In the figure, f: A= {(x, Y | X + ¥ < 1} > R. (he 
dashed line is not in the graph of f.) 


Definition Limit Let f: A c R" = R", where A is an open set. Let xX be in 
A or be a boundary point of A, and let N be a neighborhood of b € R™. We say 
f is eventually in N as x approaches xo if there exists a neighborhood U of Xo 
such that x Æ Xo, x € U , and x € A imply f(x) € N. [The geometric meaning of 
this assertion is illustrated in Figure 2.2.8; note that xp need not bein the set A, so 
that f (Xo) is not necessarily defined.] We say f (x) approaches b as x approaches 
Xo, or, in symbols, 


limit f (x) = b or f(x) > b as X— Xp, 


when, given any neighborhood N of b, f is eventually in N as x approaches Xo 
[thatis, “ f (x) is close to b if x is close to Xo]. It may be that as x approaches Xo, 
the values f (x) do not get close to any particular vector. In this case, we say that 
limity+x, f (x) does not exist. 


Henceforth, whenever we consider the notion limity_.,, f (x), we shall always assume 
that Xo either belongs to some open set on which f is defined or is on the boundary of 
such a set. 

One reason we insist on x Æ Xo in the definition of limit will become clear if we 
remember from one-variable calculus that we want to be able to define the derivative 
f’(Xo) of a function f at a point x9 by 


£(%) = birmit 6X) = FOX) 
X —> X0 X — Xo 


and this expression is not defined at x = Xo. 


(a) Thisexampleillustrates alimitthat does not exist. Consider thefunction f: R —> R 


defined by 
1 ifx >0 
nedi ifx <0. 
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The limit,_.o f (x) does not exist, since there are points x, arbitrarily close to 0 with 
f (xı) = 1 and also points xz arbitrarily close to 0 with f (x2) = —1; that is, there is no 
single number that f is close to when x is close to 0 (see Figure 2.2.9). If f is restricted 
to the domain (0, 1) or (—1, 0), then the limit does exist. Can you say why? 


figure 2.2.9 The limit of this function 
as x — 0 does not exist. 


figure 2.2.10 The limit of this function as x > O 
is Zero. 


(0, 1) 


(b) This exampleillustrates a function whose limit does exist, but whose limiting value 
does not equal its value at the limiting point. Define f: R —> R by 


0 ifx 40 
Fon) = {4 ifx =0. 


It is true that limits» f(x) = 0, since for any neighborhood U of 0,x e U and 
x Æ Oimplies that f (x) = 0. We see from the graph in Figure 2.2.10 that f approaches 
0 as x — 0; we do not care that f happens to take on some other value at0. A 


Use the definition to verify that the “obvious” limit xx, X = Xo holds, where x and 
Xo € R’. 


Let f be the function defined by f(x) = x, and let N be any neighborhood of Xo. We 
must show that f (x) is eventually in N asx — Xo. According to the definition, we must 
find a neighborhood U of Xo with the property that if x + Xo and x €e U , then f(x) €e N. 
PickU =N.Ifx eU, thenx e N; because x = f(x), it follows that f(x) € N. Thus, 
we have shown that limity.x, X = Xo. In a similar way, we have 


limit X = Xo, etc. A 
(x, y)—>(Xo, Yo) 


In what follows, you may assume, without proof, the validity of limits from one- 


variable calculus. For example, limit,.3./X = /1 = 1 and limits_.o sin 6 = sin 0 = 0 
may be used. 
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example 5 


solution 


(This example demonstrates another case in which the limit cannot simply be “read off” 
from the function.) Find limit,.19(x) where 


x—1 


vVX-1 
This function is graphed in Figure 2.2.11(a). 


g: Xe 


exe [k+l 


(b) 


figure 2.2.11 These graphs are the same except that in part (a), g is undefined at x = 1, 
whereas in part (b), g* is defined for all x > 0. 


We see that g(1) is not defined, because division by zero is not defined. H owever, if 
we multiply the numerator and denominator of g(x) by ./x + 1, we find that for all x 
in the domain of g we have 

x-1 


g= le x £1. 


The expression g*(x) = ./X + 1 is defined and takes the value 2 at x = 1; from one- 
variable calculus, g*(x) —> 2asx — 1.Butbecauseg*(x) = g(x) forallx >0,x 41, 
we must have as well that g(x) = 2asx > 1. A 


We will consider other examples in two variables shortly. 


Properties of Limits 


To properly speak of the limit, we should establish that f can have at most one limit 
as X —> Xo. This is intuitively clear and we now state it formally. (See the Internet 
supplement for the proof.) 


Theorem 2 Uniqueness of Limits 


If limit f (x) = bi and limit f(x) = bp, then b; = ho. 


To carry out practical computations with limits, we require some rules for limits; for 
example, that the limit of a sum is the sum of the limits. These rules are summarized in 
the following theorem (see the Internet supplement for Chapter 2 for the proof). 
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Theorem 3 Properties of Limits Let f: A c R" > R", g: ACR" > R", 
Xo bein A or be a boundary point of A, b € R™, and c € R; then 


(i) If limity.,, f(x) = b, then limity.,,cf(x) = cb, where cf: A > R” is 
defined by x + c( f(x)). 
(ii) If limity.x, f (x) = bı and limit,.,,9(x) = bz, then limitx.,, ( f + 9)(x) = 
bı + bo, where ( f +g): A > R™ is defined by x —> f(x) + g(x). 
(iii) If m = 1, limity.,, f(x) = bi, and limity.,,g(x) = be, then 
limity. x. ( fg)(x) = bıb2, where ( fg): A —> Risdefined by x > f(x)g(x). 


(iv) If m = 1, limitk., f(x) = b 4 0, and f(x) 4 0 for all x € A, then 
limity.x)1/f (x) = 1/b, where 1/f: A — Ris defined by x +> 1/f (x). 


(v) If f(x) = ( fi(x),..., fm(x)), where fi: A > R,i = 1,...,m, are the 
component functions of f, then limity.,, f(x) = b = (bi, ..., Dm) if and 
only if limity.x, ES = b; foreachi =1,...,m. 


These results ought to be intuitively clear. For instance, rule (ii) says that if 
f (x) is close to bı and g(x) is close to b) when x is close to Xp, then f(x) + g(x) 
is close to bı + bz when x is close to Xo. The following example illustrates how this 
works. 


Let f: R? > R, (x, y) œ> x? + y? + 2. Compute the limit 


limit f(x, y). 
(x,y) > (0,1) (x,y) 


Here f is the sum of the three functions (x, y) > x2, (x, y) => y?, and (x, y) > 2. 
The limit of a sum is the sum of the limits, and the limit of a product is the product of 
the limits (Theorem 3). Hence, using the fact that limit(,, y)+(xo, yo)X = Xo (Example 4), 
we obtain 


limit x?= (| limitx limitx } =x 
(x,y) > (Xo Yo) (Gy) (%orYo) / (OGY) (Xo, Yo) 
and, using the same reasoning, limiti, y) (xo,ya) Y? = Yo. Consequently, 


fat — n2 2 < 
limit f(x% y)=0 +1°+2=3. 4 


(x,y)—(0, 


Continuous Functions 


In single-variable calculus we learned that the idea of a continuous function is based on 
the intuitive notion of a function whose graph is an unbroken curve; that is, a curve that 
has no jumps, or the kind of curve that would be traced by a particle in motion or by a 
moving pencil point that is not lifted from the paper. 
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figure 2.2.13 (a) Discontinuous 
function for which limit x F(X) 
does not exist. 

(b) Continuous function for 
which this limit exists and equals 
F(X). 


g: Xe x2 


(a) (b) 


figure 2.2.12 The function f in part (a) is not continuous, because its value jumps as x crosses 0, 
whereas the function g in part (b) is continuous. 


To perform a detailed analysis of functions, we need concepts more precise than this 
rather vague notion. An example may clarify these ideas. Consider the specific function 
f: R — R defined by f(x) = —1 if x < Oand f(x) = 1 if x > 0. The graph of f is 
shown in Figure 2.2.12(a). [The little open circle denotes the fact that the point (0, 1) 
does not lie on the graph of f.] Clearly, the graph of f is broken at x = 0. Consider 
also the function g: x +» x2. This function is pictured in Figure 2.2.12(b). The graph 
of g is not broken at any point. 

If we examine examples of functions like f , whose graphs are broken at some point 
Xo, and functions like g, whose graphs are not broken, we see that the principal difference 
between them is that for a function like g, the values of g(x) get closer to g(Xo) as x 
gets closer and closer to Xo. The same idea works for functions of several variables. But 
the notion of closer and closer does not suffice as a mathematical definition; thus, we 
shall formulate these concepts precisely in terms of limits. 

Because the condition limitx>x f(x) = f (Xo) means that f(x) is close to f (Xp) 
when x is close to Xo, we see that this limit condition does indeed correspond to the 
requirement that the graph of f be unbroken (see Figure 2.2.13, where we illustrate the 
case f: R — R). The case of several variables is easiest to visualize if we deal with 
real-valued functions, say f : IR? —> R. In this case, we can visualize f by drawing its 
graph, which consists of all points (x, y, z) in R? with z = f(x, y). The continuity of 
f thus means that its graph has no “breaks” in it (see Figure 2.2.14). 
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where f is discontinuous 


(a) (b) 


figure 2.2.14 (a) A discontinuous function of two variables. (0) A continuous function. 


Definition Continuity Let f: A c R" —> R" be a given function with 
domain A. Let Xp € A. We say f is continuous at xp if and only if 


limit f (x) = f (Xo). 
X— X0 
If we just say that f is continuous, we shall mean that f is continuous at each 


point Xo of A. If f is not continuous at Xo, we say f is discontinuous at Xo. If f 
is discontinuous at some point in its domain, we say f is discontinuous. 


example 7 Any polynomial p(x) = aj +a1X +---+a,X" is continuous from R to R. Indeed, from 
Theorem 3 and Example 4, 


limit (ao + 1X +--+ + yx") = limit ao + limit ayx +- -- + limit anx" 
X—> X0 X—> Xo X—> X0 ñ X—> Xo 
= āo + 41X9 +: ++ + anXo, 


because the limit of a product is the product of the limits, which gives 


n 
limit x" = (limit x) =x A 


X>X0 X— X0 


example 8 | Let f: R? > R, f(x, y) = xy. Then f is continuous, because, by the limit theorems 
and Example 4, 


limit xy = limit x limit y) =Xoyo. a 
(x,y)—> (xo, Yo) (x,y)—> (xo, Yo) (x,y) (Xo, yo) 


We can see by the same method that any polynomial p(x, y) [for example, p(x, y) = 
3x? — 6xy? + y?] in x and y is continuous. 
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example 9 


example 10 


solution 


The function f : R? — R defined by 


1 ifx <Oory <0 
l= i otherwise 


is not continuous at (0, 0) or at any point on the positive x axis or positive y axis. Indeed, 
if (Xo, Yo) = wis sucha point (i.e., Xo = Oand yo > 0, or yo = O and Xp > 0) ands > 0, 
there are points (x, y) € D3(u), aneighborhood of u, with f(x, y) = 1 and other points 
(x, y) e Ds(u) with f(x, y) = 0. Thus, itis not true that f(x, y) > f (Xo, Yo) = las 
(x, y) = (Xo, Yo). A 


To prove that specific functions are continuous, we can avail ourselves of the limit 
theorems (see Theorem 3 and Example 7). If we transcribe those results in terms of 
continuity, we are led to the following: 


Theorem 4 Properties of Continuous Functions Suppose that f: A c 
R” > R”, g: A CR" — R”, and let c bea real number. 


(i) If f is continuous at Xo, so is cf, where (cf )(x) = cI f (x)]. 


(ii) If f and g are continuous at Xo, so is f + g, where the sum of f and g is 
defined by ( f + g)(x) = f(x) + g(x). 

(iii) If f and g are continuous at Xọ and m = 1, then the product function fg 
defined by ( fg)(x) = f(x)g(x) is continuous at Xo. 


(iv) If f: A c R" — R is continuous at xo and nowhere zero on A, then the 
quotient 1/f is continuous at Xo, where (1/f )(x) = 1/f (x). 


(v) If f: A cR — Rand f(x) = ( fi(x), ..., fm(x)), then f is continuous 
at Xo if and only if each of the real-valued functions fı, ..., fm is continuous 
at Xo. 


A variant of (iv) is often used: If f(x) 4 0 and f is continuous, then f(x) 4 Oina 
neighborhood of Xo and so 1/f is defined in that neighborhood, and 1/f is continuous 
at Xo. 


Let f: R? > R?, (x, y) > (x?y, (y + x3)/(1 + x?)). Show that f is continuous. 


To see this, itis sufficient, by property (v) of Theorem 4, to show that each component 
is continuous. As we have mentioned, any polynomial in two variables is continuous; 
thus, the map (x, y) +> x?y is continuous. Because 1 + x? is continuous and nonzero, 
by property (iv), we know that 1/(1 + x?) is continuous; hence, ( y + x) /(1+ x?) isa 
product of continuous functions, and by (iii) is continuous. A 


Similar reasoning applies to examples like the function c: IR —> R? given by e(t) = 
(t?, 1, t3/(1 + t?)) to show they are continuous as well. 


figure 2.2.15 The composition 


of fong. 


example 11 


solution 
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Composition 


Next we discuss composition, another basic operation that can be performed on func- 
tions. If g maps A to B and f maps B to C , the composition of g with f, or of f on 
g, denoted by f og, maps A to C by sending x +> f(g(x)) (see Figure 2.2.15). For 
example, sin (x?) is the composition of x + x? with y > sin y. 


Theorem 5 Continuity of Compositions Let g: A c R" —> R" and let 
f: B c R — RP. Suppose g(A) c B, so that f o g is defined on A. If g is 
continuous at Xọ € A and f is continuous at Yọ = g (Xo), then f o g is continuous 
at Xo. 


The intuition behind this is easy; the formal proof in the Internet supplement follows 
a similar pattern. Intuitively, we must show that as x gets close to Xo, f (g(x)) gets close 
to f (g(Xo)). But as x gets close to Xo, g(x) gets close to g (Xo) (by continuity of g at Xo); 
and as g(x) gets close to g(Xo), f(g(x)) gets close to f (g(Xo)) [by continuity of f at 
Q(Xo)]. 


Let f(x, y, Z) = (x? + y? + z?) + sin z?. Show that f is continuous. 


Herewecanwrite f asasum of thetwo functions (x?+y?+z?) and sin z?, soit suffices 
to show that each is continuous. The first is the composite of (x, y, z) œ> (x? + y? +z?) 
with u +> u*°, and the second is the composite of (x, y, z) œ> z? with u + sin u, and 
so we have continuity by Theorem5. A 


Limits in Terms of e's and 4's 


We now state a theorem (proved in the Internet supplement for Chapter 2) giving 
a useful formulation of the notion of limit in terms of epsilons and deltas that is 
often taken as the definition of limit. This is, in fact, another way of making precise the 
intuitive statement that “ f (x) is close to b when x is close to Xo.” To help understand this 
formulation, the reader should consider it with respect to each of the examples already 
presented. 


Theorem 6 Let f: A c R" > R" and let Xo bein A or bea boundary point of 
A. Then limit» f (x) = b if and only if for every number £ > 0 thereisasd > 0 
such that for any x €e A Satisfying 0 < ||x — Xol| < 6, we have || f(x) — b|| < £ 
(see Figure 2.2.16). 
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figure 2.2.16 The geometry of 
the e-6 definition of limit. 


example 12 


solution 


example 13 


solution 


Image of Ds (XQ) 


To illustrate the methodology of the epsilon-delta technique in Theorem 6, we con- 
sider the following examples. 


Show that limit(x,y)-.(0,0)X = 0 using the e-d method. 


Note that if 5 > 0, |I(x, y) — (0, 0) || = \/x2 + Y? < ô implies |x — 0| = |x| = Vx? < 
yX? + y? < ô. Thus, if I(x, y) — (0, 0)|| < 6, then |x — O| is also less than 5. Given 
e > 0, weare required to find a 8 > 0 (generally depending on €) with the property that 
0 < |I\(x, y) — (0, O)|] < 5 implies |x — 0| < £. What are we to pick as our 6? From 
the preceding calculation, we see that if we choose 5 = e, then ||(x, y) — (0, 0)|| < 6 
implies |x — 0| < £. This shows that limit(x, y).(0,0)X =0. Given e > 0, we could have 
also chosen ô = ¢/2 or £/3, but it suffices to find just one 6 satisfying the requirements 
of the definition of a limit. A 


Consider the function 


sin (x? + y?) 
OS aye 


Even though f is not defined at (0, 0), determine whether f(x, y) approaches some 
number as (x, y) approaches (0, 0). 


From one-variable calculus or L'H 6pital’s rule we know that 


.,, sing 
limit — = 1. 
a>0 a 


Thus, it is reasonable to guess that 


oe __,, sin |v]? 
limit f(v) = limit STINE 
v—> (0,0) v>(0,0) |v]? 
Indeed, because limit,_.9 (sin a) /a@ = 1, given £ > 0 we are able to find aé > 0, with 
0 <6 <1, such that 0 < |a| < 5 implies that |( sin w)/a@ — 1| < e. If 0 < |ivi| < ô, 
then 0 < ||v||2 < 6? < 4, and therefore 


f(v) -1| = 


sin |v]? | 
Iivi]? 


example 14 


solution 


example 15 
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figure 2.2.17 Graph of the function f(x, y) = (sin (x? + y?))/(x? + y?). 


Thus, limity_. (0,0) f (v) = 1. If we plot [sin (x? + y?)]/(x? + y?) on a computer, we get 
a graph that is indeed well behaved near (0, 0) (Figure 2.2.17). A 


Show that 


2 
limit ——— = 0. 
(x,y)+(0,0) ,/x2 + y2 


We must show that x?/./x2 + y? is small when (x, y) is close to the origin. To do this, 
we use the following inequality: 


x2 xX? + y? 
0< < because y? > 0 
Smr aye | oa 
= /x2 + y2, 


Given e > 0, choose 6 = e. Then ||(x, y) — (0, 0) || = W(x, y) || = /x2 + y?, and so 
I(x, y) — (0, 0) || < 8 implies that 


x? x? 
0| = < \/x24+ Y? = |\(x, y) — (0,0 b =e. 
arse | Vety? = +y I(x, vy) — ( Il < E 


Thus, the conditions of Theorem 6 have been fulfilled and the limitis verified. A 


(a) Does 
Taai 2 2 2 
PTa EES 
exist? [See Figure 2.2.18(a).] 
(b) Prove that [see Figure 2.2.18(b)] 


2 
imt ao 


(x,y) > (0,0) X? + y? 
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As (x, y) approaches 2 z 
(0, 0) along this 


ridge, Z—>1 
WY 
s 
w 
DB x NV 


SS. 
’ SOS Sx So 
SYS ee 


Nin 


As (x, y) approaches 
(0, 0) in this 
(a) valley, z > 0 (b) 


figure 2.2.18 (a) The function z = x?/(x* + y?) has no limit at (0, 0). (b) The function 
z= (2x*y)/(x? + y?) has limit O at (O, 0). 


solution | (a) If thelimit exists, x?/(x? + y?) should approach a definite value, say a, as (x, y) 
gets near (0, 0). In particular, if (x, y) approaches zero along any given path, then 
x*/(x? + y?) should approach the limiting value a. If (x, y) approaches (0, 0) along the 
line y = 0, the limiting value is clearly 1 (just set y = 0 in the preceding expression to 
get x?/x? = 1). If (x, y) approaches (0, 0) along the line x = 0, the limiting value is 


2 


Hence, limiti, y)-.(0,0)X°/(x? + y?) does not exist. 
(b) Note that 


Thus, given « > 0, choose 5 = «/2; then 0 < |\(x, y) — (0, O)| = \/x2+y2 < 6 
implies |y| < ô, and thus 


Using the -ô notation, we are led to the following reformulation of the definition of 
continuity. 


Theorem 7 Let f: A c R" —> R" be given. Then f is continuous at x9 € A 
if and only if for every number £ > 0 there is a number 5 > 0 such that 


xeA and |X — Xoll < ô implies \| f (x) — f (x9) || < €. 


The proof is almost immediate. Notice that in Theorem 6 we insisted that 0 < 
IIx — Xoll; that is, x A Xo. That is not imposed here; indeed, the conclusion of Theorem 7 
is certainly valid when x = Xp, and so there is no need to exclude this case. H ere we do 
care about the value of f atx»; we want f at nearby points to be close to this value. 
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In the following exercises you may assume that the exponential, sine, and cosine functions are continuous and may freely use 


techniques from one-variable calculus, such as L'H ôpital’s rule. 


1. 


Let f: R? —> R and suppose that 


lim. _ f(x, y) = 5. What can you say about the 


(x,y)—>(1,3) 
value f (1,3)? 


. Let f: R? — R is continuous and suppose that 
limx,y) (1,3) f(x, y) = 5. What can you say about the 


value f (1, 3)? 


» Compute the limits: 


(a) limit and 
(x,y) — (0, 


(b) MoS 


eh _ 
2 Ye 


« Compute the following limits: 


(a) limit @% 
(x,y) — (0,1) y 


n2 
E xX 
(b) mit 0 * 


2 

|. SIN X 
c) limit 
(c) limit — > 


» Compute the following limits: 


(a) limit(x? — 3x +5) 
Xx> 
(b) limitsin x 
x>0 
2 _ y2 
Cima 
h>0 
Let 


f(x,y) = { he if (x,y) 4 (0,0) 
0 if (x, y) = (0, 0). 


(a) Compute the limit as (x, y) > 


path x = 0. 
(b) Compute the limit as (x, y) > 
path x = y3. 


(c) Show that f is not continuous at (0, 0). 


eX+y 
» Let f(x, y, z) = 5 


limpo £(1,24h,3)~-#(£2,3) 


(0, 0) of f along the 
(0, 0) of f along the 


13. 


8. Compute the following limits if they exist: 


E TEE 

(a) limit (x + y)*—(x—y) 
(x,y) — (0,0) xy 

(b) limit 

(x,y) (0,0) y 


Ç limit 
(c) (x,y) — (0,0) rar 


9. Compute the following limits if they exist: 
. @ey—1 
imit ——— 
(x,y) +(0,0) y 
or cos(xy) —1 
b limit § ——., —— 
Oio  x2y? 


xy 
3 limit  —=— 
aa (0,0) x? + y2 +2 


10. Compute the following limits if they exist: 


xy 


a limit 
(a) (x,y) > (0,0) X +1 


<. CoSX — 1 —(x2/2) 
b limit, §$£——__, —_,—— 
m (x,y) > (0,0) x4 + y4 


oa eyy 
Cc limit 
(c) (x,y) = (0,0) X? + y? 


11. Compute the following limits if they exist: 


aan inx 
(a) limit ny 
(x,y) (0,0) XY 
sin (xyz) 
(b) y 
(x,y,z) > (0,0,0) XyZ 


(c) limit 
(x,y,z) — (0,0,0) 


(x? +3y?)/(x +1) 


12. Compute the following limits if they exist: 


sin2x — 2x 
limit ———.—— 
(a) x>0 x3 
o, in2x — 2x 
(b) limit ee 
(x,y) = (0,0) X? +y 
2x?y cosz 


C mi 2 
(c) (x,y,z) > (0,0,0) X2 + y2 


cases: 


(a) f: ROR, XP |x|, x9=1 
(b) f: R” > R, xt |x|], arbitrary xo 


f(x, y,z), where f(x, y, z) = 


Compute limity_,x, f (x), if it exists, for the following 
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14, 


15. 
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(c) f:R > R2,x m (x2, e*), x9 = 1 


(d) f: R2\{(0, 0)} > R2, (x, y) e (sin(x — y), 
ex(¥+) _ x — 1)/\I(x, y)|], Xo = (0, 0) 


Let f(x, y,z) = Way Describe geometrically 
the set in IR? where f fails to be continuous. 


Where is the function f(x, y) = wy continuous? 


Show that the subsets of the plane in Exercises 18-21 are open: 


18. 
19. 
20. 
21. 
22. 


23. 


24. 


25. 


26. 


A={(x,y)|-l<x<1,-l<y<J 

B = {(x, y) | y > 0} 

C= {(x,y) 12 < x? +y? <4} 

D = {(x, y) |x # 0 and y 4 0} 

Let A c R? be the open unit disk D1(0, 0) with the 


point Xo = (1, 0) added, and let f: A > R, xr f(x) 
be the constant function f (x) = 1. Show that 
limitx—xo f (x) = 1; 


If f: R” — Rand g: R” — R are continuous, show 
that the functions 


f2g: R” + R, x e [f (x)]?g(x) 
and 
f2? +g: R > R, x I [f (x)? + g(x) 
are continuous. 


(a) Show that f: R —> R, x > (1— x)? +cos(1+x3) 
is continuous. 

(b) Show that the map 
f: R —> R, x 6 xĉ?e* /(2 — sin x) is continuous. 


(a) Can [sin (x + y)]/(x + y) be made continuous by 
suitably defining it at (0, 0)? 

(b) Can xy/(x2 + y?) be made continuous by suitably 
defining itat (0, 0)? 

(c) Prove that 
f: R? > R, (x, y) ye + sinx + (xy)4 is 
continuous. 


Using either e's and ô's or spherical coordinates, show 
that 
aa xyz 
limit — a =0. 
(x,y,z) > (0,0,0) X*+y*+Z 


16. 


17. 


27. 


28. 


30. 


31. 


32. 


33. 


(a) Considering A:R? —> R? as a linear map, explicitly 
write the component functions of A. 


(b) Show that A is continuous on all of R2. 


Find lim (3x? + 3y2) log(x? + y?). (HINT: Use 
(x, y) (0,0) 


polar coordinates.) 


Use the e-5 formulation of limits to prove that x? — 4 
asx — 2. Give another proof using Theorem 3. 


(a) Prove that for x e R” ands < t, Ds(x) c D(x). 
(b) Prove that if U and V are neighborhoods of x € R", 
then so areU AV andU UV. 


(c) Prove that the boundary points of an open interval 
(a,b) c R are the points a and b. 


. Suppose x and y are in R” and x Æ y. Show that there 


is a continuous function f: R” — R with f(x) = 1, 
f(y) = 0, and 0 < f(z) < 1 for every zin R". 


Let f: A c R” > R be given and let xọ be a boundary 
point of A. We say that limitx—x, f (x) = œœ if for every 
N > 0 thereisaé > 0 such that 0 < ||x — xol] < 6 and 
x e A implies f(x) > N. 


(a) Prove that limity_.1 (x — 1)7? = oo. 
(b) Prove that limity_,91/|x| = oo. Is it true that 
limity_,91/x = 00? 


(c) Prove that limit, y) (0,0) 1/(x2 + y?) = œœ. 


Letb e Rand f: R\[b] —> R bea function. We write 
limity_sp— f(x) = L and say that L is the left-hand 
limit of f atb if for every « > 0 there is a 8 > 0 such 
that x < b and 0 < |x — b| < dimplies|f(x) —L| < e. 


(a) Formulate a definition of right-hand limit, or 
limitkb+ f(x). 


(b) Find limitx„o—1/(1 + e2/*) and 
limity.941/(1 + e!/*). 


(c) Sketch the graph of 1/(1 + e!/*), 


Show that f is continuous at xo if and only if 
limit || f(x) — f (xo) || = 0. 
X— Xo 


Let f: A c R” — R" satisfy 
| f(x) — f(y) |] < K [|x — y|” for all x and yin A for 
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positive constants K and a. Show that f is continuous. 35. (a) Find a specific number 5 > 0 such that if Ja] < 6, 
(Such functions are called H older-continuous or, if then |a3 + 3a2 +. a| < 1/100. 
a = 1, Lipschitz-continuous.) (b) Find a specific number 5 > 0 such that if 


x2 + y? < 84, then 


34, Show that f: IR" — R" is continuous at all points if 


and only if the inverse image of every open set is open. 
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Ix? + y? + 3xy + 180xy5] < 1/10,000. 


In Section 2.1 we considered a few methods for graphing functions. By these meth- 
ods alone it may be impossible to compute enough information to grasp even the 
general features of a complicated function. From elementary calculus, we know that 
the idea of the derivative can greatly aid us in this task; for example, it enables us 
to locate maxima and minima and to compute rates of change. The derivative also 
has many applications beyond this, as you have surely has discovered in elementary 
calculus. 

Intuitively, we know from our work in Section 2.2 that a continuous function is one 
that has no “breaks” in its graph. A differentiable function from R? to R ought to be 
such that not only are there no breaks in its graph, but there is a well-defined plane 
tangent to the graph at each point. Thus, there must not be any sharp folds, corners, or 
peaks in the graph (see Figure 2.3.1). In other words, the graph must be smooth. 


Partial Derivatives 


To make these ideas precise, we need a sound definition of what we mean by the phrase 
“f(X1,...,Xn) is differentiable atx = (Xj, ..., Xn).” Actually, this definition is not quite 
as simple as one might think. Toward this end, however, let us introduce the notion of the 
partial derivative. This notion relies only on our knowledge of one-variable calculus. 
(A quick review of the definition of the derivative in a one-variable calculus text might 
be advisable at this point.) 


Z= f(x, y) Peak 


Corner 


Fold 


Pit 


(a) (b) 


figure 2.3.1 (a) A smooth graph and (b) a nonsmooth one. 
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Definition Partial Derivatives Let U c R" be an open set and suppose 
f: U CR" > Risareal-valued function. Then df /dX1,..., Of /0Xn, the partial 


derivatives of f with respect to the first, second, ... , nth variable, are the real- 
valued functions of n variables, which, at the point (X1, ..., Xn) = X, are defined 
by 

es) _ jj (Xt, Xa aaen Xp EM. Xn} = FX. Xn) 

Ox h>0 


lim £O¢+ hey) — f(x) 
h=0 h 


if the limits exist, where 1 < j < n and ej is the jth standard basis vector defined 
by ej = (0,...,1,..., 0), with 1in the jth slot (see Section 1.5). The domain of 
the function df /dx; is the set of x € R” for which the limit exists. 


In other words, df /dx; is just the derivative of f with respect to the variable xj, 
with the other variables held fixed. If f: R? — R, we shall often use the notation 
af /ax, af /ay, af /az in place of df /dx1, Of /AX2, af /dx3. If f: U C R" —> R”, then 
we can write 


P(X sor Xn) = (f1(%1, -.-) Xnde ee ey fm(Xa, -- Xn), 
so that we can speak of the partial derivatives of each component; for example, 3fm/3Xn 


is the partial derivative of the mth component with respect to x, the nth variable. 


example | If f(x, y) = xy + y?, find af /ax and af /ay. 


solution To find af /ax we hold y constant (think of it as some number, say 1) and differentiate 
only with respect to x; this yields 


af = a(x?y +y?) 


Similarly, to find af /ay we hold x constant and differentiate only with respect to y: 


af a(x*y+y?) o , 
a ay = Xf + 3y^. m 


To indicate thata partial derivative isto be evaluated at a particular point, for example, 
at (Xo, Yo), we write 


i ) or d A 
ax o Yo ax ax | 


X=X0, Y=Vo Xo, Yo) 

When we write z = f(x, y) for the dependent variable, we sometimes write 0z/ax for 
af /dx. Strictly speaking, this is an abuse of notation, but it is common practice to use 
these two notations interchangeably. 
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If z = cosxy +x cosy = f(x, y), find the two partial derivatives (3z/3x) (Xo, Yo) and 
(9Z/dy)(Xo, Yo). 


First we fix yo and differentiate with respect to x, giving 


az A(COS XYo + X COS Yo) 
—(Xo, Yo) = Yo L 
OX OX TEA 

= (—Yo SİN XYo + COS Yo) |x=xo 


= —Yo SİN XoYo + COS Yo. 


Similarly, we fix Xo and differentiate with respect to y to obtain 


az (COS Xoy + Xo COS 
Gt giles ( oy + Xo COs Y) 
dy dy y=Yo 


= (—Xo SIN Xoy — Xo SİN Y) |y=yo 
= —Xo SİN XoYo — Xo SİN Yo. A 


Find af /ox if f(x, y) = xy/,/x2 + y?. 
By the quotient rule, 


BF x? + y? NEET _ x? rey y? 


aX x2 + y2 ~ Ry T KRF A 


A definition of differentiability that requires only the existence of partial derivatives 
turns out to be insufficient. M any standard results, such as the chain rule for functions 
of several variables, would not follow, as Example 4 shows. B elow, we shall see how to 
rectify this situation. 


Let f(x, y) = x'/3y/3, By definition, 


af _., f(h,0)— (0,0) |, .. 0-0 
xO 


and, similarly, (af /ay)(0, 0) = 0 (these are not indeterminate forms!). It is necessary 
to use the original definition of partial derivatives, because the functions x!/3 and y1/3 
are not themselves differentiable at 0. Suppose we restrict f to the line y =x to get 
f (x, x) = x?” (see Figure 2.3.2). Wecan view the substitution y = x asthecomposition 
f og of the function g: R —> R?, defined by g(x) = (x, x), and f: R? — R, defined 
by f(x, y) =x4y¥3, 

Thus, the composite f o g is given by (f o g)(x) = x*/3. Each component of g is 
differentiable in x, and f has partial derivatives at (0, 0), but f og is not differentiable 
at x = 0, in the sense of one-variable calculus. In other words, the composition of 
f with g is not differentiable in contrast to the calculus of functions of one variable, 
where the composition of differentiable functions is differentiable. Later, we shall give 
a definition of differentiability that has the pleasant consequence that the composition 
of differentiable functions is differentiable. 

There is another reason for being dissatisfied with the mere existence of partial 
derivatives of f(x, y) = x!/3y/3: Thereis no plane tangent, in any reasonable sense, to 
the graph at (0, 0). The xy plane is tangent to the graph along the x and y axes because 
f has slope zero at (0, 0) along these axes; that is, af /ax = 0 and af /ay = 0 at (0, 0). 
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figure 2.3.2 The portion of the graph of 
x1/3 1/8 in the first quadrant. 


Thus, if there is a tangent plane, it must be the xy plane. However, as is evident from 
Figure 2.3.2, the xy plane is not tangent to the graph in other directions, because the 
graph has a severe crinkle, and so the xy plane cannot be said to be tangent to the graph 
of f. A 


The Linear or Affine Approximation 


To “motivate” our definition of differentiability, let us compute what the equation of the 
plane tangent to the graph of f: R? > R, (x, y) => f(x, y) at (Xo, Yo) ought to be if 
f is smooth enough. In R3, anonvertical plane has an equation of the form 


Z=ax+by +c. 


If it is to be the plane tangent to the graph of f, the slopes along the x and y axes 
must be equal to af /ax and af /ay, the rates of change of f with respect to x and y. 
Thus, a = of /ax, b = af /dy [evaluated at (Xo, Yo) ]. Finally, we may determine the 
constant c from the fact that z = f(Xo, Yo) when x = Xo, y = Yo. Thus, we get the 


‘linear approximation (or, more accurately said, affine approximation): 


of of 
z= Fv yo) + [oo yo) (x —H0) + [FE O80, vofiy- 
which should be the equation of the plane tangent to the graph of f at (Xo, Yo), if f is 
“smooth enough” (see Figure 2.3.3). 
Our definition of differentiability will mean in effect that the plane defined by the 
linear approximation (1) is a “good” approximation of f near (Xo, Yo). To get an idea of 


Z 


Tangent plane 
of graph f at 


(Xo, Yo, F(Xo, Yo) 


i, figure 2.3.3 For points (x, y) near (%, yo). 


the graph of the tangent plane is close to 


~ (0, Yo, F(X0, Yo)) the graph of f. 


(Xo, Yo) 
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what we might mean by a good approximation, let us return for a moment to one-variable 
calculus. If f is differentiable at a point xo, then we know that 


f (Xo + Ax) — f (Xo) 


ae 
Let x = Xo + Ax and rewrite this as 
imit LEX) = FXO) _ ergy, 


X—>X0 X — Xo 


Using the trivial limit limit,_.,, f’(Xo) = f’(Xo), we can rewrite the preceding equation 
as 


limit FOX) = F0%0) = limit f’(X9); 
X>Xo X — Xo X—> Xo 
that is, 
limit [122 = 10) rs] wo. 
X—> Xo X — Xo 
that is, 
limit f (x) — f (Xo) — f’(X0)(X — Xo) ai 
X— Xo X — Xg 


Thus, the tangent line! through (Xo, f (Xo)) with slope f’(X9) is close to f in the sense 
that the difference between f(x) and I(x) = f(Xo) + f’(Xo)(x — Xo), the equation of 
the tangent line goes to zero even when divided by x — Xo as x goes to Xo. This is the 
notion of a “good approximation” that we will adapt to functions of several variables, 
with the tangent line replaced by the tangent plane [see equation (1), given earlier]. 


Using the linear approximation, we are ready to define the notion of differentiability. 


Definition Differentiable: Two Variables Let f: R? — R. We say f is 
differentiable at (xo, yo), if af /ax and af /ðy exist at (Xo, Yo) and if 


F(x, y) — f (xo, Yo) — Eto yo] ~ x9) — Fo, yo] (y = 
> 0 


I(x, y) — (Xo, Yo) Il (2) 


as (x, y) —> (Xo, Yo). This equation expresses what we mean by saying that 
of of 
f (Xo, Yo) + [oo ya) (x — Xo) + ES yo) (y — Yo) 


is a good approximation to the function f. 


It is not always easy to use this definition to see whether f is differentiable, but it 
will be easy to use another criterion, given shortly in Theorem 9. 
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Tangent Plane 


We have used the informal notion of the plane tangent to the graph of a function to 
motivate our definition of differentiability. Now weareready to adoptaformal definition 
of the tangent plane. 


Definition Tangent Plane Let f: R? —> Rbedifferentiableat x = (Xo, Yo). 
The plane in R? defined by the equation 


2 = F(X) Yo) + [Zi yd] ey [Zo JE a 


is called the tangent plane of the graph of f atthe point (Xo, Yo, f (Xo, Yo)). 


Compute the plane tangent to the graph of z = x? + y* + e% at the point (1, 0, 2). 


Use formula (1), with Xo = 1, yo = 0, and Zo = f (Xo, Yo) =2. The partial derivatives 
are 


dz OZ 
— =2x + ye” and —— = dy? + xe”, 
OX TI oy oe 


At(1, 0, 2), these partial derivatives are 2 and 1, respectively. Thus, by formula (1), the 
tangent plane is 


z=2(x —1)+1(y —0) +2, that is, z=2xX+y. A 


Let us write Df (Xo, Yo) for the row matrix 


EZ yo) inn w|, 


so that the definition of differentiability asserts that 


f (Xo, Yo) + Df (Xo, Yo) p a A 
= F (Xo. Yo) + Eg y) (x — xa) + Es ya) (y-yo) 3 


is our good approximation to f near (Xo, Yo). As earlier, “good” is taken in the sense that 
expression (3) differs from f(x, y) by something small times \/(x — xo)? + (Y — Yo)?. 
We say that expression (3) is the best linear approximation to f near (Xo, Yo). 


Differentiability: The General Case 


Now we are ready to give a definition of differentiability for maps f of R” to R™, using 
the preceding discussion as motivation. The derivative Df (Xo) of f = ( fa, ..., fm) at 
a point Xo is a matrix T whose elements are tij = ðf; /x; evaluated at Xo.’ 


“It turns out that we need to postulate the existence of only some matrix giving the best linear approxi- 
mation near Xp € R", because in fact this matrix is necessarily the matrix whose i jth entry is əfi /ax; 
(see the Internet supplement for Chapter 2). 
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Definition Differentiable, n Variables, m Functions Let U be an open 
set in R” and let f: U c R" —> R" be a given function. We say that f is 
differentiable at x9 € U if the partial derivatives of f exist at Xo and if 


xox Ix- xo] 


where T = Df (xo) is the m x n matrix with matrix elements əfi /dx; evaluated 
at Xo and T(x — Xo) means the product of T with x — Xo (regarded as a column 
matrix). We call T the derivative of f at Xo. 


We shall always denote the derivative T of f at xo by D f (Xo), although in some 
books it is denoted df (Xo) and referred to as the differential of f. In the case where 
m = 1, the matrix T is just the row matrix 


Zo e wl, 


ax, OXn 


(Sometimes, when there is danger of confusion, we separate the entries by commas.) 
Furthermore, setting n = 2 and putting the result back into equation (4), we see that 
conditions (2) and (4) do agree. Thus, if we let h = x — Xp, a real-valued function f of 
n variables is differentiable at a point Xo if 


eon l T af 
limit a f00+ 8) f (Xo) D4 3x; "Ih = 0) 
because 
n 
Lue N (Xo) 


For the general case of f mapping a subset of R” to R™, the derivative is them x n 
matrix given by 


afi afi 
Df (x)= | : > Je 
afm afm 


where df; /dx; is evaluated at Xo. The matrix Df (Xo) is, appropriately, called the matrix 
of partial derivatives of f at xp. 


Calculate the matrices of partial derivatives for these functions. 
(a) f(x, y) = (e+ + y, y?x) 

(b) f(x,y) = (x? + cosy, ye*) 

(c) f(x, y, z) = (ze*, —ye’) 
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(a) Here f : R? — R? is defined by f(x, y) = e*t! + y and f,(x, y) = y?x. Hence, 
Df (x, y) isthe 2 x 2 matrix 


Df (x,y) = ki exty 4p i 


y?  2xy 
(b) We have 


(c) In this case, 


Gradients 
For real-valued functions we use special terminology for the derivative. 


Definition Gradient Consider thespecial case f: U c R" —> R. Here Df (x) 
isal x n matrix: 


Df tx) = | are 1 
0X1 OXn 
We can form the corresponding vector (df /0X1,..., Əf /AX,), called the gradient 


of f and denoted by Vf, or grad f. 


From the definition, we see that for f: R? > R, 


ea ae 
a 


whilefor f: R? > R, 


wap 


The geometric significance of the gradient will be discussed in Section 2.6. In terms 
of inner products, we can write the derivative of f as 


Df (x)(h) = Vf (x) -h. 


Let f: R? > R, f(x, y, z) = xe’. Then 


əf af əf 
PL |) alay yey 
— (> ay’ =) (e , Xe 0); A 
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If f: R? > Risgiven by (x, y) + e” + sinxy, then 


Vf (x, y) = (ye + y cosxy)i + (xe” + x cosxy)j 
= (e¥ +cosxy)(yi+ xj). A 


In one-variable calculus itis shown that if f is differentiable, then f is continuous. 
We will state in Theorem 8 that this is also true for differentiable functions of several 
variables. As we know, there are plenty of functions of one variable that are continuous 
but not differentiable, such as f(x) = |x|. Before stating the result, let us give an 
example of a function of two variables whose partial derivatives exist at a point, but 
which is not continuous at that point. 


Let f: R? — R be defined by 


_ fi ifx =Oorify =0 
N= p otherwise. 


Because f is constant on the x and y axes, where it equals 1, 


af of 
ag 0)=0 and T 0) =0. 


But f is not continuous at (0, 0), because limit(x,y) — (0,0) f(x, y) does not exist. A 


Some Basic Theorems 
The first of these basic theorems relates differentiability and continuity. 


Theorem 8 Let f: U c R" — R™ be differentiable at x») c U. Then f is 
continuous at Xo. 


This result is very reasonable, because “differentiability” means that there is enough 
smoothness to have a tangent plane, which is stronger than just being continuous. Consult 
the Internet supplement for Chapter 2 for the formal proof. 

As we have seen, it is usually easy to tell when the partial derivatives of a function 
exist using what we know from one-variable calculus. H owever, the definition of differ- 
entiability looks somewhat complicated, and the required approximation condition in 
equation (4) may seem, and sometimes is, difficult to verify. Fortunately, there is a simple 
criterion, given in the following theorem, that tells us when a function is differentiable. 


Theorem 9 Let f:U c R" > R™. Suppose the partial derivatives afj /ax; 
of f all exist and are continuous in a neighborhood of a point x e U. Then f is 
differentiable at x. 
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We give the proof in the Internet supplement for Chapter 2. Notice the following 
hierarchy: 


Definition 
Theorem 9 of derivative 


Continuous partials > Differentiable = Partials exist 


Each converse statement, obtained by reversing an implication, is invalid. [For a coun- 
terexample to the converse of the first implication, use f (x) = x? sin(1/x), f(0) = 0; 
for the second, see Example 1 in the Internet supplement for Chapter 2 or use Example 4 
in this section.] 

A function whose partial derivatives exist and are continuous is said to be of class 
C1. Thus, Theorem 9 says that any C function is differentiable. 


Let 


cos x + e% 
f(x,y) = “ae 


Show that f is differentiable at all points (x, y) 4 (0, 0). 
Observe that the partial derivatives 


af (x? + y?)( ye*” — sinx) — 2x(cosx + e*’) 


ax (x2 + y2)2 
af (x? +.y?)xe*Y — 2y(cosx +e”) 
dy (x2 + y?)? 


are continuous except when x = 0 and y = 0 (by the results in Section 2.2). Thus, f is 
differentiable by Theorem 9. A 


In the Internet supplement we show that f (x, y) = xy/4/x? + y? [with f (0, 0) = 0] 
is continuous, has partial derivatives at (0, 0), yet is not differentiable there. See Fig- 
ure 2.3.4. By Theorem 9, its partial derivatives cannot be continuous at (0, 0). 


figure 2.3.4 This function is not differentiable 
at (0, 0), because it is “crinkled.” 
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1. Find of /ax, of /ay if 
(a) f(x,y) =xy 
(b) f(x, y) =e% 
(c) f(x, y) = x cosx cos y 
(d) f(x, y) = (x? + y?) log (x? + y?) 


2. Evaluate the partial derivatives 3z/əx, Əz/ðy for the 
given function at the indicated points. 


(a) z= \/a2 — x? — y?; (0, 0), (a/2, a/2) 
(b) z = log /T+xy;(1, 2), (0, 0) 
(c) z = @* cos(bx + y); (2z7/b, 0) 


3. In each case following, find the partial derivatives 
aw /Əx, aw /ay. 


(a) w = xex+y? 


x? +y? 
(b) w = eae 
(c) w = e% log (x? + y?) 
(d) w=x/y 


(e) w = cos(ye*Y) sinx 


4. Decide which of the following functions are C 1, which 
are just differentiable. 


2xy 
(x2 + y2)2 


(b) a 
y X 


(a) f(x,y) = 


(c) f(r,6) = 4r sin20,r > 0 


xy 
d) f(x, y) = ——— 
(d) f(x, y) Fray 
aiee 
= vay ye 


5. Find the equation of the plane tangent to the surface 
z =x? + y? at (3, 1, 10). 


6. Let f(x, y) = e¥+Y. Find the equation for the tangent 
plane to the graph of f at the point (0, 0). 


7. Let f(x, y) =e*—Y. Find the equation for the tangent 
plane to the graph of f at the point (1, 1). 


8. Using the respective functions in Exercise 1, compute 
the plane tangent to the graphs at the indicated points. 


(a) (0, 0) 
(b) (0, 1) 


10. 


11. 


12. 


13. 


14. 


15. 
16. 


17. 


18. 


(c) (0, x) 
(d) (0,1) 


» Compute the matrix of partial derivatives of the 


following functions: 


(a) f: R? > R?, f(x, y) = (x, y) 

(b) f: R? — R3, f(x, y) = (xeY + cosy, x, x +e) 
(c) f: R3 > R?, f(x, y, z) =(x + e + y, yx?) 

(d) f: R? > R3, f(x, y) = (xye%, xsin y, 5xy?) 


Compute the matrix of partial derivatives of 
(a) f(x, y) = (e%, sin xy) 

(b) f(x,y,z) =(x =y, y +2) 

(c) f(x,y) =(x+y,x-— y, xy) 

(d) f(x,y,z) = (x +z, y — 5z, x — y) 
Find the equation of the tangent plane to 


f(x, y) = x? — 2xy + 2y? having slope 2 in the positive 
x direction and slope 4 in the positive y direction. 


Let f(x, y) = e(2x+3y), 
(a) Find the tangent plane to f at (0, 0). 


(b) Use this to approximate f(.1, 0) and f(0, .1). 


(c) With a calculator, find the exact values of f (.1, 0) 
and f (0, .1). 


Where does the plane tangent to z = e*~Y at (1, 1, 1) 
meet the z axis? 


Why should the graphs of f(x, y) = x? + y? and 
g(x, y) = —x2 — y? + xy? be called “tangent” at (0, 0)? 


Let f(x, y) = e*Y. Show that x(of/ax) = y(of /ay). 
Use the linear approximation to approximate a suitable 
function f(x, y) and thereby estimate the following: 
(a) (0.99e0-02)8 


(b) (0.99)? + (2.01)3 — 6(0.99)(2.01) 
(c) «/(4.01)2 + (3.98)2 + (2.02)? 


Let P be the tangent plane to the graph of 

g(x, y) = 8 — 2x2 — 3y? at the point (1, 2, 6). Let 
f(x, y) = 4 — x? — y?. Find the point on the graph of f 
which has tangent plane parallel to P . 


Let f(x,y) = xe — yer, 


(a) Find the equation for the tangent plane to the graph 
of f at(1, 2). 
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(b) What point on the surface z = x? — y? has a 
tangent plane parallel to the plane found in part (a)? 


Compute the gradients of the following functions: 
(a) f(x, y, z) = x exp(—x? — y? — z2) (Note that 
expu = œ.) 
_ xyz 
(b) f(x,y,z) = a yF 


(c) f(X, y, z) = z?e* cosy 


Compute the tangent plane at (1, 0, 1) for each of the 
functions in Exercise 19. [The solution to part (c) only is 


23. 


24, 
25. 


(b) Show that f is not differentiable at (0,0) by showing 
that f is not continuous at (0,0). 


Let P be the tangent plane to f (x, y) = xy? at 

(1, 2, 8). Let! be the line contained in P which passes 
through the point (1, 3, 20) and passes directly above 
(2, 1). Thatis, | contains the point (1, 3, 20) and a point 
of the form (2, 1, z). Find a parametrization for |. 


Calculate Vh(1, 1, 1) if h(x, y, z) = (x + z)e*-Y. 


Let f(x, y, z) = x? + y? — 22, Calculate Vf (0, 0, 1). 


in the Study Guide.] 


26. Evaluate the gradient of f(x, y, z) =log(x2 + y? + z?) 
at (1, 0, 1). 


21. Find the equation of the tangent plane to z = x? + 2y3 


at (1, 1, 3). 27. Describe all Hdlder-continuous functions with œ > 1 
(see Exercise 33, Section 2.2). (HINT: What is the 
22. Let derivative of such a function?) 
24, 
eye zee f (x, y) # (0, 0) 28. Suppose f: R” > R™ is a linear map. What is the 
if (x, y) = (0, 0) derivative of f? 


(a) Show that ĉf (0, 0) and o, 0) exist. 


X 
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figure 2.4.1 The map cis the 
path; its image C is the curve we 
“see.” 


In this section, we introduce some of the basic geometry and computational methods 
for paths in the plane and space. This will be an important ingredient for the chain rule 
treated in the next section. We will return to paths with additional topics in Chapter 4. 


Paths and Curves 


We often think of a curve as a line drawn on paper, such as a straight line, a circle, 
or a sine curve. It is useful to think of a curve C mathematically as the set of values 
of a function that maps an interval of real numbers into the plane or space. We shall 
call such a map a path. We usually denote a path by c. The image C of the path then 
corresponds to the curve we see on paper (see Figure 2.4.1). Often we write t for the 


c(b) 
= curve = image of c 


> y 


example | 


example 2 


figure 2.4.3 e(f) = (cost, sin +) isa 


path whose image Cis the unit 


circle. 
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independent variable and imagine it to be time, so that e(t) is the position at time t of 
a moving particle, which traces out a curve as t varies. We also say c parametrizes C . 
Strictly speaking, we should distinguish between c(t) as a point in space and as a vector 
based at the origin. 


The straight line L in R? through the point (Xo, Yo, Zo) in the direction of vector v is the 
image of the path 


c(t) = (Xo, Yo, Zo) + tv 


fort € R (see Figure 2.4.2). Thus, our notion of curve includes straight lines as special 
cases. 


figure 2.4.2 L is the straight line in space 
through (xo, yo, Zo) and in direction v; its 
equation is c(t) = (X, yo. Zo) + tv. 


The unit circle C : x? + y? = 1 in the plane is the image of the path 
c:R—>R’, c(t) =(cost,sint), O<t<2z 


(see Figure 2.4.3). The unit circle is also the image of the path €(t) = (cos2t, sin 2t), 
0 <t < x. Thus, different paths may parametrize the same curve. A 


Paths and Curves A path in R” isa map c: [a,b] — R"; itis a path in the 
plane if n = 2 and a path in space if n = 3. The collection C of points c(t) ast 
varies in [a, b] is called a curve, and c(a) and ¢(b) are its endpoints. The path c 
is said to parametrize the curve C . We also say c(t) traces out C ast varies. 

If cisapath in R?, wecan writec(t) = (x(t), y(t), z(t)), and wecall x(t), y(t), 
and z(t) the component functions of c. We form component functions similarly 
in R? or, generally, in R". We also consider paths whose domain is the whole real 
line as in the next example. 


The path c(t) = (t, t?) traces out a parabolic arc. This curve coincides with the graph 
f (x) = x? (see Figure 2.4.4). 
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figure 2.4.4 The image of e(f) = (t, P) is the 
c(-1) = (-1, 1) (1, 1) = c(1) parabola y= x?. 


A wheel of radius R rolls to the right along a straight line at speed v. Use vector methods 
to find the path c(t) of the point on the wheel that initially lies at a distancer below the 
center. 


We place the wheel in the xy plane with its center initially at(0, R), so that the position 
of the center at time t is given by the path C(t) = (vt, R). (Refer to Figure 2.4.5.) 

The position of the point e(t) relative to the center is given by the vector d(t) = 
c(t) — C(t) that has the initial value —rj and rotates in the clockwise direction. The 
rate of rotation is such that the wheel makes a full rotation after the center has moved 
a distance 27R (equal to the circumference of the wheel). This takes a time 27R /v, so 
the angular velocity d@/dt of the wheel is v/R. Because the rotation is clockwise, the 
vector function d(t) is of the form 


d(t) =r (cos[-zt-+o]i+sin [-žt +eli) 


for some initial angle 6. Because d(0) = —rj, we have cos@ = 0 and sin @ = —1, so 
8 = —7/2, and hence 


d(t) =r (cos[-z — 5 |i+sin |-ġt — zli} 


figure 2.4.5 The vector d(f) points from the wheel's center, C(t), to the position e(t) of a point 
on the wheel and rotates in the clockwise direction while the wheel moves to the right. 


figure 2.4.7 An example of a 
hypocycloid. 
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c(t) =(t—sint,1—cost) 


> X 


2a 


figure 2.4.6 The curve traced by a point moving on the rim of 
a rolling circle is called a cycloid. 


Usingcos(g—2/2) = sin gandsin(g—z/2) = — cos o, along with cos(—@~) = cos o 
and sin(—g) = —sing, we get 
. ut, ut, 
d(t) =r (sng i — cos =i). 
Finally, the path e(t) is given by adding the components of the vector function d(t) to 
the coordinates of the path C(t); the result is 


, ut ut 
t) = (vt iR : 
c(t) € rsins cos $) 


In the special case v = R = r = 1, we get c(t) = (t — sint, 1 — cost). The image 
curve C of this path c is shown in Figure 2.4.6; itis called acycloid. A 


The preceding example considered the path of a point not necessarily on the rim of a 
wheel rolling along a straight line. W hen the wheel rolls on a circle, the resulting curve 
is called an epicycle. These are the epicycles discussed in the Ptolemaic theory in the 
introduction. If the wheel is outside the circle and the point is on the rim, the curve is 
called an epicycloid, and when the wheel is inside the circle, it is a hypocycloid. An 
example of the latter is shown in Figure 2.4.7. 


The French mathematician Blaise Pascal studied the cycloid in 1649 as a 

way of distracting himself at a time when he was suffering from a painful 
toothache. When the pain disappeared, he took it as a sign that God was 

not displeased with his thoughts. Pascal’s results stimulated other mathematicians 
to investigate this curve, and subsequently numerous remarkable properties 
were found. One of these was discovered by the Dutchman Christian Huygens, 
who used it in the construction of a “perfect” pendulum clock. 


Velocity and Tangents to Paths 


If we think of c(t) as the curve traced out by a particle and t as time, it is reasonable to 
define the velocity vector as follows. 


120 Differentiation 


figure 2.4.8 The vector e'(?) is 
tangent to the path e(f. 


Definition Velocity Vector If cis a path and itis differentiable, we say cisa 
differentiable path. The velocity of c at time t is defined by? 


c(t +h) = a) 


c(t) = lim h 


We normally draw the vector c’(t) with its tail at the point c(t). The speed of 
the path c(t) iss = ||¢’(t)||, the length of the velocity vector. If c(t) = (x(t), y(t)) 
in R?, then 

c(t) = (x(t), y(t) = x'(t)i + y(t)j 
and if c(t) = (x(t), y(t), z(t)) in R3, then 


c(t) = (x(t), y(t), z(t) = x"(t)i + y(t)j + z'(t)k. 


Here, x’(t) isthe one-variable derivative d x /dt. If we accept limits of vectors interpreted 
componentwise, the formulas for the velocity vector follow from the definition of the 
derivative. However, the limit can be interpreted in the sense of vectors as well. In 
Figure 2.4.8, we see that [c(t + h) — ¢(t)]/h approaches the tangent to the path as 
h—> 0. 


Tangent Vector The velocity c(t) is a vector tangent to the path c(t) at time 
t. If C isa curve traced out by c and if c’(t) is not equal to 0, then c(t) is a vector 
tangent to the curveC at the point c(t). 


31f t lies at the endpoint of an interval, we should, as in one-variable calculus, take right- or left-handed 
limits. 


example 5 


solution 


example 6 


solution 


example 7 


solution 
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If we think of the derivative De(t) as a matrix, it will be a column vector with the 
entries x(t), y’(t), and z’(t). Thus, the derivative here is consistent with our earlier 
notion. 


Compute the tangent vector to the path c(t) = (t, t?, et) att = 0. 


Here c'(t) = (1, 2t, et), and so att = 0 we obtain the tangent vector (1, 0, 1). A 


Describe the path c(t) = (cost, sint, t). Find the velocity vector at the point on the 
image curve wheret = x /2. 


For a given t, the point (cost, sint, 0) lies on the circle x? + y?=1 in the xy plane. 
Therefore, the point (cost, sint, t) lies t units above the point (cost, sint, 0) if t is 
positive and —t units below (cost, sint, 0) if t is negative. Ast increases, (cost, sint, t) 
wraps around the cylinder x? + y? = 1 with the z coordinate increasing. The curve this 
traces out is called a helix, which is depicted in Figure 2.4.9. Att = 2/2,¢(2/2) = 
(—sinz/2, cosx/2, 1) = (—1, 0, 1) = —i + k. 


figure 2.4.9 The helix e() = (cos t, sin t, f) wraps 


c(n/2)=-j+k 
ie) J around the cylinder xX? + y? =1. 


=y 
cost, SİN T, 1) 


(cos t, sin t 0) 
A 


The cycloidal path of a particle on the edge of a wheel of radius R with speed v is given 
by c(t) = (vt — R sin (vt/R), R — R cos(vt/R)). (See Example 4.) Find the velocity 
c'(t) of the particle as a function of t. When is the velocity zero? Is the velocity vector 
ever vertical? 


To find the velocity, we differentiate: 


vt , ut 
= € v cos 5, usin g | 

In vector notation, c'(t) = (v — v cos (vt /R ))i+ (v sin (vt/R))j. The component in the 
direction of iis v(1— cos (vt/R )), which is zero whenever vt/R is an integer multiple of 
2x. For such values of t, sin (vt/R ) is zero as well, so the only times at which the velocity 
is zero are when t = 2xnR /v for some integer n. At such times, e(t) = (27nR, 0), 
so the moving point is touching the ground. These moments occur at time intervals of 
2xR /v (more frequently for small wheels, as well as for rapidly rolling ones). 

The velocity vector is never vertical, because the horizontal component vanishes 
only when the vertical one does as well. A 
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example 8 


solution 


figure 2.4.10 Velocity vectors for the 
curve traced out by a point on the rim of 
a rolling wheel. 


Figure 2.4.10 shows some velocity vectors superimposed on the cycloidal path of 
Figure 2.4.6. 


Tangent Line 


The tangent line to a path at a point is the line through the point in the direction 
of the tangent vector. U sing the point-direction form of the equation of a line, we obtain 
the parametric equation for the tangent line. 


Tangent Line to a Path If c(t) isa path, and if c'(tọ) 4 0, the equation of its 
tangent line at the point c(t) is 


I(t) = e(to) + (t — to)e’(to). 


If C is the curve traced out by c, then the line traced out by I is the tangent line to 
the curve C at ¢(ty). 


Notice that we have written the equation in such a way that I goes through the point 
C(to) att = to (rather than t = 0). See Figure 2.4.11. 


A path in R? goes through the point (3, 6, 5) att = 0 with tangent vector i — j. Find the 
equation of the tangent line. 


The equation of the tangent line is 
I(t) = (3, 6, 5) + t(i—j) = (3, 6, 5) + (1, —1, 0) = (3 +t, 6 — t, 5). 
In (x, y, z) coordinates, the tangent lineisx =3+t,y=6-t,z=5. A 


Physically, we can interpret motion along the tangent line as the path that a particle 
on a curve would follow if it were set free at a certain moment. 


figure 2.4.11 The tangent line to a path. 
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example 9 | Suppose that a particle follows the path e(t) = (et, et, cost) until it flies off on a 
tangent att = 1. Whereis it att = 3? 


solution | Thevelocity vector is (et, —e, —sin t), which att = 1 is the vector (e, —1/e, —sin 1). 
The particle is at (e, 1/e, cos1) att = 1. The equation of the tangent line is I(t) = 
(e, 1/e, cos1) + (t — 1)(e, —1/e, —sin 1). Att = 3, the position on this line is 


(3) = (e > cos1) +2 (e -sin1) = (3e -i cos1 -2sin1) 


= (8.155, —0.368, —1.143). a 


exercises 


Sketch the curves that are the images of the paths in Exercises 1 to 4. 


1l. x = sint, y = 4cost, where 0 < t < 2x (b) Find a parametrization for C inducing a clockwise 
orientation and starting at (0, 2). 
2. x = 2sint, y = 4cost, where 0 < t < 2x (c) Find a parametrization for C if itis now centered at 


the point (4, 7). 


3. c(t) = (2t —1,t+2,t) 
6. Give a parametrization for each of the following curves: 
4. c(t) = (—t, 2t, 1/t), wherel <t <3 , l 
(a) The line passing through (1, 2, 3) and (—2, 0, 7) 
5. Consider the circle C of radius 2, centered at the origin. (b) The graph of f(x) = x? 
(a) Find a parametrization for C inducing a (c) The square with vertices (0, 0), (0, 1), (1, 1), and 
counterclockwise orientation and starting at (1, 0) (Break it up into line segments.) 
2 
(2, 0). (d) The ellipse given by + + % =1 


In Exercises 7 to 10, determine the velocity vector of the given path. 
7. c(t) = 6ti+ 3t?j + tk 9. r(t) = (cos?t, 3t — t3, t) 
8. c(t) = (sin 3t)i + (cos 3t)j + 2t3/2k 10. r(t) = (4e’, 6t4, cost) 


In Exercises 11 to 14, compute the tangent vectors to the given path. 


11. c(t) = (et, cost) 15. When is the velocity vector of a point on the rim of a 
rolling wheel horizontal? W hat is the speed at this 

12. c(t) = (3t2, t3) point? 

13. c(t) = (tsint, 4t) 16. If the position of a particle in space is (6t, 3t2, t3) at 


time t, what is its velocity vector att = 0? 
14. c(t) = (t?, e?) 


In Exercises 17 and 18, determine the equation of the tangent line to the given path at the specified value of t. 


17. (sin3t, cos3t, 2t5/2);t = 1 18. (cos?t, 3t —t3, t);t = 0 
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In Exercises 19 to 22, suppose that a particle following the given path c(t) flies off on a tangent at t = to. Compute the 
position of the particle at the given time tz. 


19, 
20. 
21. 
22. 
23. 


c(t) = (t2, t? — 4t, 0), where to = 2, t1 = 3 
c(t) = (et, et, cost), where to = 1, tı = 2 
c(t) = (4et, 6t4, cost), where tp = 0, tı = 1 
c(t) = (sinet, t, 4 — t3), where to = 1, tı = 2 


The position vector for a particle moving on a helix is 
c(t) = (cos(t), sin(t), t?). 


(a) Find the speed of the particle at time tọ = 47. 
(b) Is e’(t) ever orthogonal to e(t)? 
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(c) Find a parametrization for the tangent line to e(t) at 
to = 4r. 


(d) Where will this line intersect the xy plane? 


24. Consider the spiral given by e(t) = (et cos(t), et sin(t)). 
Show that the angle between c and c’ is constant. 


25. Letelt) = (t3, t2, 2t) and 
f(x, y, 2) = (x? — y?, 2xy, z2). 


(a) Find (f oc)(t). 


(b) Find a parametrization for the tangent line to the 
curve f ocatt =1. 


In elementary calculus, we learn how to differentiate sums, products, quotients, and 
composite functions. We now generalize these ideas to functions of several variables, 
paying particular attention to the differentiation of composite functions. The rule for 
differentiating composites, called the chain rule, takes on a more profound form for 
functions of several variables than for those of one variable. 

If f isareal-valued function of one variable, written asz = f(y), and y isafunction 
of x, written y = g(x), then z becomes a function of x through substitution, namely, 
z = f(g(x)), and we have the familiar chain rule: 


If f is a real-valued function of three variables u, v, and w, written in the form z = 
f(u, v, w), and the variables u, v, w are each functions of x, u = g(x), v = h(x), and 
w = k(x), then by substituting g(x), h(x), and k(x) for u, v, and w, we obtain z as a 
function of x: z = f(g(x), h(x), k(x)). The chain rule in this case reads: 


OF ou a əz dw 
dx dudx ðvdx ðw dxX` 


One of the goals of this section is to explain such formulas in detail. 


Sums, Products, Quotients 
These rules work just as they do in one-variable calculus. 
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Theorem 10 Sums, Products, Quotients 


(i) Constant Multiple Rule Let f: U c R" — R™ bedifferentiable at Xo and 
let c be areal number. Then h(x) = cf (x) is differentiable at xo and 


Dh(xo) = cDf (Xo) (equality of matrices). 


(ii) Sum Rule. Let f: U c R" > R” and g: U c R" > R" be differentiable 
at Xo. Then h(x) = f(x) + g(x) is differentiable at xo and 


Dh(xo) = Df (Xo) + Dg(Xo) (sum of matrices). 


(iii) Product Rule. Let f: U c R" > Rand g: U CR" — R be differentiable 
at Xo and let h(x) = g(x) f (x). Then h: U c R" — R is differentiable at Xo 
and 


Dh(xo) = g(Xo)Df (Xo) + f (Xo)Dg(xXo). 
(Note that each side of this equation isa 1 x n matrix; amore general product 
rule is presented in Exercise 31 at the end of this section.) 
(iv) Quotient Rule. With the same hypotheses asinrule (iii), leth(x) = f(x)/g(x) 


and suppose g is never zero on U . Then h is differentiable at xp and 


n g(xo)Df (Xo) — f (xo) Dg (xo) 


paon) TRAE 


proof The proofs of rules (i) through (iv) proceed almost exactly as in the one- 
variable case, with a slight difference in notation. We shall prove rules (i) and (ii), 
leaving the proofs of rules (iii) and (iv) as Exercise 27. 


(i) To show that Dh(Xo) = cDf (Xo), we must show that 


h(x) — h(Xo) — cDf (Xo)(x — Xo) || 


limit =0, 
X>X0 IIx — xoll 
that is, that 
limit IIc f (x) — cf (Xo) — cDf (xo) (x — Xo) I| 0, 
X> Xo IIx — xoll 


[see equation (4) of Section 2.3]. This is certainly true, since f is differentiable and 
the constant c can be factored out [see Theorem 3(i), Section 2.2]. 
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solution 


(ii) By the triangle inequality, we may write 


h(x) — h(Xo) — [Df (xo) + Dg (Xo) 1(x — Xo) | 


|X — xoll 
=i f(x) — f (Xo) — [Df (Xo) ](x — Xo) + g(x) — g (Xo) — [Dg (Xo) ](x — Xo) || 
|X — Xoll 
P If (x) — f (Xo) — [Df (Xo)](x — Xo) I| i g(x) — g(Xo) — [Dg(Xo) ](x — Xo) || 
E IIx — Xoll IIx — Xoll ' 


and each term approaches 0 as x —> Xo. Hence, rule (ii) holds. i 


Verify the formula for Dh in rule (iv) of Theorem 10 with 
f(x, y,z) =X? + y? +z? andg(x, y,z) =x? +1. 


Here 


xX ye 2 


ee acre 


so that by direct differentiation 


_ foh ah ah] [(x?+1)2x —(x?+y?427)2x 2y 22 
Dine a Se ay |=| (x? + 1)? 'x2?+1'x?+1 
o f2x(l—y?—2z?) 2y 2z 
7 (x2+1)2 "x24 1' x241]° 


By rule (iv), we get 


= gDf — fDg _ (x? + 1)[2x, 2y, 2z] — (x? + y? +27)[2x, 0, 0] 


aa g? (x? +1)? 


which is the same as what we obtained directly. A 


Chain Rule 


A s we mentioned earlier, it is in the differentiation of composite functions that we meet 
apparently substantial alterations of the formula from one-variable calculus. H owever, if 
we use the D notation, that is, matrix notation for derivatives, the chain rule for functions 
of several variables looks similar to the one-variable rule. 


Theorem 11 Chain Rule LetU c R"andV c R" beopensets.Letg: U c 
R” > R” and f: V c R™ > R” begiven functions such that g maps U into V, 
so that f og is defined. Suppose g is differentiable at Xo and f is differentiable at 
Yo = 9(Xo). Then f 


D(f o g)(xo) = Df (Yo)Dg (xo). (1) 


The right-hand side is the matrix product of D f (yo) with Dg (Xo). 
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We shall now give a proof of the chain rule under the additional assumption that the 
partial derivatives of f are continuous, building up to the general case by developing 
two special cases that are themselves important. (The complete proof of Theorem 11 
without the additional assumption of continuity is given in the Internet supplement for 
Chapter 2.) 


First Soecial Case of the Chain Rule 


Suppose c: R —> R? is a differentiable path and f: R? > R. Let h(t) = f(e(t)) = 
f (x(t), y(t), z(t)), where c(t) = (x(t), y(t), z(t)). Then 


dh ofdx afdy əf dz 
dt axdt | ay dt © az dt’ 


That is, 
dh yfi, 
dt 

(where e!(t) = (x’(t), y(t), z/(t)). 


Thisisthe special case of Theorem 11 in which wetakec = g and f to be real-valued, 
and m = 3. Notice that 


where the product on the left-hand side is the dot product of vectors, while the product 
on the right-hand side is matrix multiplication, and where we regard Df (c(t)) as a row 
matrix and De(t) as a column matrix. The vectors Vf(¢(t)) and c(t) have the same 
components as their matrix equivalents; the notational change indicates the switch from 
matrices to vectors. 


proof of equation (2) By definition, 


dh o. 
get = limit 


Adding and subtracting two terms, we write 


h(t) —h(to) — F(x(t), y(t), 2(t)) — f(x(to), y(to), z(to)) 


f=t t— to 


f(x(to), y(t), z(t)) — f(x(to), y(to), z(t)) 


f(x(to), y(to), z(t)) — f (x(to), y(to), z(to)) 
+ 
t — to 
Now we invoke the mean-value theorem from one-variable calculus, which states: If 
g: [a, b] —> Ris continuous and is differentiable on the open interval (a, b), then there 
is a point c in (a, b) such that g(b) — g(a) = g’(c)(b — a). Applying this to f asa 
function of x, we can assert that for some c between x and Xo, 


f(x, y, z) — f (Xo, y, Z) = Ec y, 2)|( = rhe 
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In this way, we find that 


h(t)—h(to) faf x(t) — x(to) Taf aiad 
t= Ee yt) 20) k [Z iton a zep E 
af Yal 
T [Zit ys) oe 


where c, d, and e lie between x(t) and x(to), between y(t) and y(to), and between z(t) 
and z(to), respectively. Taking the limit t — to, using the continuity of the partials 
af /ax, af /ay, af /az, and the fact that c, d, and e converge to x(to), y(to), and z(t), 
respectively, we obtain formula (2). m 


Second Special Case of the Chain Rule 
Let f: R? —> Rand let g: R? > R?. Write 
g(x, y, Z) = (u(x, y, Z), v(x, y, Z), W(X, y, Z)) 
and define h: R? — R by setting 
h(x, y, z) = f (u(x, y, Z), v(x, y, 2), W(X, y, 2)). 


In this case, the chain rule states that 


ðu ðu du 

əx ay az 
E ah | E af aa ðv ðv ðv (3) 
ax oy az ðu ðv ow əx əy az 

ow ðw aw 

əx əy az 


In this special case, we have taken n = m = 3 and p = 1 for concreteness, 
and U = R? and V = R? for simplicity, and have written out the matrix product 
[Df (yo) ][Dg(Xo)] explicitly (with the arguments Xo and yo Suppressed in the matrices). 


proof of the second special case of the chain rule By definition, ah/ax 
is obtained by differentiating h with respect to x, holding y and z fixed. But then 
(u(x, y, Z), v(x, Y, Z), W(X, y, Z)) may be regarded as a vector function of the single 
variable x. The first special case applies to this situation and, after the variables are 
renamed, gives 


ah af au af dv af aw 


Bx dudX | Jvəx | OW OX” (3) 


Similarly, 


ah _ of əu af du af aw 


ay du əy dvdy ow dy (3") 


and 


ah _ of au | af av | af aw 
əz gudz = dvdz aw ðz` 


(3”) 
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In actual calculations, at the last step, we usually express %, 2, and 2 in terms of 
X, y, z. These equations are exactly what would be obtained by multiplying out the 


matrices in equation (3). m 


proof of theorem 11 The general casein equation (1) may be proved in two steps. 
First, equation (2) is generalized to m variables; that is, for f (X1, ..., Xm) and c(t) = 
(Xa(t), ..., Xm(t)), we have 


dh 1 af dXi 
at a at 


where h(t) = f(xı(t), ..., Xm(t)). Second, the result obtained in the first step is used 
to obtain the formula 


m 


ah, = » af; OY 


ax 4—4 ayk OX; | 
where f = (fı, ..., fp) is a vector function of arguments yi, ..., Ymi9(X1,---, Xn) = 
Wry <., Xn), -1 Von (Xty coon Xn); and NG (Kay ase Xn) = fil Ya, cone Xn) eves 
Ym(X1, -.-,Xn)). (Using the letter y for both functions and arguments is an abuse of 


notation, butitcan help us remember the formula.) This formulais equivalent to formula 
(1) after the matrices are multiplied out. E 


The pattern of the chain rule will become clear once you have worked some additional 
examples. For instance, 


_ df au, af av | of aw | af az 
~ gu ax  dvax OW ax az ax’ 


~ (u(x, y), v(x, y), W(X, y), 20%, y)) 


with a similar formula for af /ay. 

The chain rule can help us understand the relationship between the geometry of 
a mapping f: R? — R? and the geometry of curves in IR’. (Similar statements may 
be made about IR? or, generally, IR".) If ¢(t) is a path in the plane, then as we saw in 
Section 2.4, c(t) represents the tangent (or velocity) vector of the path e(t), and this 
tangent (or velocity) vector is thought of as beginning at c(t). Now let p(t) = f (c(t)), 
where f : R? —> R?. The path p represents the image of the path c(t) under the mapping 
f . The tangent vector to p is given by the chain rule: 


matrix 
multiplication 


P'O =Df(c(t)) c(t). 
Ld 


matrix column 
vector 


In other words, the derivative matrix of f maps the tangent (or velocity) vector of 
a path c to the tangent (or velocity) vector of the corresponding image path p (see 
Figure 2.5.1). Thus, points are mapped by f, while tangent vectors to curves are 
mapped by the derivative of f, evaluated at the base point of the tangent vector in 
the domain. 
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figure 2.5.1 Tangent vectors are 
mapped by the derivative 
matrix. 


example 2 


solution 


example 3 


solution 


p'(t) = Dfle(t))e(t) 


p(t) = f(c(t)) 


p(t) is the image of e(t) under f 


Verify the chain rule in the form of formula (3’) for 
f(u, v, w) =u? +v? —w, 
where 
u(x, y,z)=x’y, v(x, y, z) =y? w(x, y, z) = e. 
Here 
h(x, y, z) = f (u(x, y, Z), v(x, y, z), W(X, y, Z)) 
= (x?y)? + yt — eo = xy? + yt e. 

Thus, differentiating directly, 


ah 
— = 4x3y* + ze-*?, 
ox i 


On the other hand, using the chain rule, 


əh oaf dus Of dua aw E 
əx du ax Bs dv OX T aw OX = 2u(2xy) + 2v-0 + (—1)(—ze-*’) 


= (2x*y)(2xy) + ze, 


which is the same as the preceding equation. A 


Given g(x, y) = (x? + 1, y?) and f (u, v) = (u + v, u, v2), compute the derivative of 
f og at the point (x, y) = (1, 1) using the chain rule. 


The matrices of partial derivatives are 


ah ah 
i 14 2x 0 
Df(u v=] %]|=]1 0 and Dg(x,y)= | | 
ðu ðv 0 2v 0 2y 
df; dfs 


ðu ðv 
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When (x, y) = (1, 1), note that g(x, y) = (u, v) = (2, 1). Hence, 
1 1 2 0 2 2 
D(f og)(1,1) = Df (2, 1)Dg(1,1)= |1 0 F J- 2 0 


is the required derivative. A 


example 4 | Let f(x,y) be given and make the substitution x = rcosé,y = r sino (polar 
coordinates). W rite a formula for af /38. 


solution By the chain rule, 


əf of ax af ay 
00 9x00 ay ae’ 


that is, 


of . of of 
— =-rsind— +r cosd—. 
əy 


00 ax A 


example 5 | Let f(x, y) = (cosy + x2, eX+Y) and g(u, v) = (e”, u — sin v). (a) Write a formula 
for f og. (b) Calculate D( f og)(0, 0) using the chain rule. 


solution (a) Wehave 


(f og)(u,v) = fle”, u — sin v) 
= (cos (u — sin v) +e, ee sacl 


(b) By the chain rule, 
D(f og)(0, 0) = [Df (g(0, 0))][Dg(0, 0)] = [Df (1, 0)][Dg(0, 0)]. 
Now 


_ f2ue” 0 fo 0 
D9(0,0) = | 1 E yan 7 E | 


and 


_ | 2x —siny _ {2 0 
Dili |e. aie _— =| 


[Remember that Df is evaluated at g(0, 0), not at (0, 0)!]. Thus, 
2 
nirogao=|2 oft ak Sl a 


example 6 | Let f:U c R" —> R” be differentiable, with f = (fı, ..., fm), and let g(x) = 
sin [ f (x) - f (x)]. Compute Dg(x). 
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solution 
f2(x) +---+ f2(x). Then 
Dh(x) = = 
0X1 
= lan 
OX, 
f=[f; 
exercises 


By the chain rule, Dg(x) = cos[ f(x) - f(x)]Dh(x), where h(x) = [f(x)- f(x)] = 


which can be written 2 f (x)Df (x), where we regard f as a row matrix, 


fm] 


of, fy 
and Df = : 
afm afm 


Thus, Dg(x) = 2[cos( f (x)- f(x) ]fœ)Df (x). A 


1. if f: U c R" — Ris differentiable, prove that 


X f?(x) +2f(x) is differentiable as well, and 
compute its derivative in terms of Df (x). 


. Prove that the following functions are differentiable, and 


find their derivatives at an arbitrary point: 


(a) f: R? >R, (x,y) 2 

(b) f: R? > R, (x,y) x+y 

(c) f: R? —> R, (x,y) 2+x+y 

(d) f: R? > R, (x, y) > x? + y? 

(e) f: R? —> R, (x, y) = e 

(f) f: U >R, (x, y) œ> y/1— x? — y2, where 
U ={(x, y) |X? +y? <1} 

(g9) f: R? > R, (x, y) xt- y4 


3. Verify the first special case of the chain rule for the 


composition f o cin each of the cases: 

(a) f(x, y) = xy, e(t) = (et, cost) 

(b) f(x, y) = eY, e(t) = (3t2, t3) 

(c) f(x,y) = (x? + y?) log \/x? + y?, c(t) = 
(e*, e™*) 

(d) f(x, y) = x exp(x? + y?), e(t) = (t, —t) 


. What is the velocity vector for each path e(t) in 


Exercise 3? [The solution to part (b) only is in the Study 
Guide to this text.] 


10. 


. Let f: R? > R and g: R? > R be differentiable. Prove 


that 


V( fg) = f Yg +9Vf. 


. Let f: R? — R be differentiable. M aking the 


substitution 


x = pcosO sing, y=psindsing, Z=pcosd 


(spherical coordinates) into f (x, y, z), compute 
df /dp, Əf/30, and of /ð¢ in terms of 
af /ax, af /ay, and af /az. 


. Let f (u, v) = (tan (u — 1) — e”, u? — v?) and 


g(x, y) = (e*7Y, x — y). Calculate f og and 
D(f og)(1, 1). 


» Let f(u, v, w) = (eX, cos(v + u) + 


sin(u + v +w)) and g(x, y) = (e*, cos( y — x), e7”). 
Calculate f o g and D(f o g)(0, 0). 


» Find (d/ds)( f oT )(1, 0), where f(u, v) = cos u sin v 


and T : R? — R? is defined by 
T (s, t) = (cos (t?s), log VI+ s2). 


Suppose that the temperature at the point (x, y, z) in 
space is T (x, y, z) = X? + y? + z2. Leta particle follow 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


the right-circular helix ø(t) = (cost, sint, t) and let 
T (t) be its temperature at time t. 


(a) WhatisT’(t)? 


(b) Find an approximate value for the temperature at 


Let f(x, y, Z) = (3y + 2, X? + y2, x +22). Let 
c(t) = (cos(t), sin(t), t). 


(a) Find the path p = f o c and the velocity vector 
p’(z). 
(b) Find ¢(z), e’(x) and D f(—1, 0, x). 


(c) Thinking of D f (—1, 0, x) as a linear map, find 
D f (—1, 0, x) (c (7x)). 


Leth: R3 —> R5 and g: R2? —> R? be given by 
h(x, y, z) = (xyz, e%, x sin(y), 32, 17) and 
glu, v) = (v? + 2u, x, 2/0). Find D(h o g)(1, 1). 


Suppose that a duck is swimming in the circle 

x = cost, y = sint and that the water temperature is 
given by the formula T = x2eY — xy3. Find dT /dt, the 
rate of change in temperature the duck might feel: (a) by 
the chain rule; (b) by expressing T in terms of t and 
differentiating. 


Let f: R" —> R™ be a linear mapping so that (by 
Exercise 28, Section 2.3) Df (x) is the matrix of f. 
Check the validity of the chain rule directly for linear 
mappings. 


Let f: R? > R2; (x, y) > (e¥+Y, eX-Y). Lete(t) bea 
path with c(0) = (0, 0) and c'(0) = (1, 1). What is the 
tangent vector to the image of c(t) under f att = 0? 


Let f(x, y) = 1/4/x2 + y2. Compute Vf (x, y). 


Write out the chain rule for each of the following 
functions and justify your answer in each case using 
Theorem 11. 


(a) əh/əx, where h(x, y) = f(x, u(x, y)) 
(b) dh/dx, where h(x) = f(x, u(x), v(x)) 
(c) ah/ax, where h(x, y,z) = f(u(x, y,2), 


v(x, y), W(x) 
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18. Verify the chain rule for əh /Əx, where 
h(x, y) = f (u(x, y), v(x, y)) and 


u2 +v? -x— 
fluo) = —z, Uy) =e, olx y) =e. 
19. (a) Let y(x) be defined implicitly by G (x, y(x)) = 0, 

where G is agiven function of two variables. Prove 


that if y(x) and G are differentiable, then 


dy aG /ax i aG 
= if 
dx aG /ay ay 


(b) Obtain a formula analogous to that in part (a) if 
yı, y2 are defined implicitly by 


Gilx, yi(X), y2(x)) = 0, 
G 2(x, yi(x), y2(x)) = 0. 


(c) Let y be defined implicitly by 
x74 y3 +e =0. 
Compute dy/dx in terms of x and y. 
20. Thermodynamics texts* use the relationship 
aaa) 
ax/ \ay/ \az/ 
Explain the meaning of this equation and prove that it is 
true. [HINT: Start with a relationship F (x, y, z) = 0 that 


defines x = f(y,z), y = g(x, z), and z = h(x, y) and 
differentiate implicitly.] 


21. Dieterici’s equation of state for a gas is 
P(V —b)ea/RVT = RT, 


where a, b, and R are constants. Regard volume V as a 
function of temperature T and pressure P and prove that 


ar (8+57)/(g-a- 2) 
aT TV V—b Ve)" 
22. This exercise gives another example of the fact that the 


chain rule is not applicable if f is not differentiable. 
Consider the function 


xy? 
iit Ce 
0 (x, y) = (0, 0). 


4See S. M. Binder, “Mathematical M ethods in Elementary Thermodynamics,” J. Chem. Educ., 43 (1966): 85-92. A proper understanding of 
partial differentiation can be of significant use in applications; for example, see M . Feinberg, “Constitutive Equation for Ideal Gas M ixtures and 
Ideal Solutions as Consequences of Simple Postulates,” Chem. Eng. Sci., 32 (1977): 75-78. 
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23. 


24, 


25. 


26. 


27. 


28. 


Differentiation 


Show that 
(a) əf/əx and of /ay exist at (0, 0). 


(b) If g(t) = (at, bt) for constants a and b, then f og 
is differentiable and ( f o g)'(0) = 
ab2/(a2 + b?), but vf (0, 0) - g'(0) = 0. 


Prove that if f: U c R" — R is differentiable at 

Xo € U , there is a neighborhood V of Oc R" anda 
function R1: V — R such that for all h € V, we have 
xo+heU, 


f (Xo +h) = f (xo) + [Df (xo)]h + Ri(h) 


and 
Ri(h) 
{| hj 


>0 a h—O. 


Suppose Xp € R” and 0 < rı < r2. Show that there is a 
Ct function f: R” —> R such that f(x) = 0 for 

|X — Xoll = 2; 0 < f(x) <1 forry < |x — Xoll < r2; 
and f(x) = 1 for ||x — Xoll < rı. [HINT: Apply a cubic 
polynomial with g(r?) = 1 and g(r3) = g'(r2) = 
g'(r2) = 0 to |x — xoll? when rı < IIx — xoll < r2.] 


Find aC! mapping f: R? —> R? that takes the vector 
i +j +k emanating from the origin to i — j emanating 
from (1, 1, 0) and takes k emanating from (1, 1, 0) to 

k — i emanating from the origin. 


W hat is wrong with the following argument? Suppose 
w = f(x, y,z) and z = g(x, y). By the chain rule, 


aw Oz ow ow Oz 


3z ax ax | Oz ax" 


aw ay 
ay ax 


aw aw ax 
əx ax Ox 


Hence, 0 = (dw /dz)(dz/dx), and so dw /dz = 0 or 
02/0x = 0, which is, in general, absurd. 


Prove rules (iii) and (iv) of Theorem 10. (HINT: Use the 
same addition and subtraction tricks as in the 
one-variable case and Theorem 8.) 


Show thath: R" —> R" is differentiable if and only if 


each of the m components h;: IR" — R is differentiable. 


(HiNT: Use the coordinate projection function and the 
chain rule for one implication and consider 


Ee — h(xo) — Dh(xo)(x — Xo) [1]? 
IX — xoll 

E Soy yl hi (x) — hi (xo)Dhi (xo)(x — xo)]? 

7 Ix — xoll? 


to obtain the other.) 


29, 


30. 


31. 


32. 


33. 


34. 


35. 


36. 


Usethe chain rule and differentiation under the integral 
sign, namely, 


d 7° baf 
xf Fos yay =f ay (XY) dy 


to show that 


d f * af 
af ienas tout | yay. 


For what integers p > 0 is 


Dei 
f(x) = 7 sin (1/x) me 


differentiable? For what p is the derivative continuous? 


Suppose f: R" —> R and g: R" —> R™ are 
differentiable. Show that the product function 

h(x) = f (x)g(x) from R” to R™ is differentiable and 
that if xo and y arein R", then [Dh(xo)]y = 

f (xo) {[D.g (Xo) ly} + {LDf (xo) ly} 9 (Xo). 


Let g(u, v) = (e4, u + sin v) and f(x, y, z) = (xy, yz). 
Compute D(g o f) at (0, 1, 0) using the chain rule. 


Let f: Rf —> R and c(t): R > R4. Suppose 
Vf(1, 1, x, e) = (0, 1,3, —7), ex) = (1, 1, x, e), 


and c'(x) = (19, 11, 0, 1). Find 2! ka 


Suppose f: R" —> R™ and g: RP > R'. 


whent = x. 


(a) What must be true about the numbers n, m, p, and 
q for f o g to make sense? 


(b) What must be true about the numbers n, m, p, and 
q for g o f to make sense? 


(c) When does f o f make sense? 


If z = f (x — y), use the chain rule to show that 
əz a _ 4 


— + — = 
ðX əy 
Letw =X? +y? +z?, x =uv, y =u cosu, 


z = u sin v. Use the chain rule to find z 
when(u, v) = (1, 0). 
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2.6 Gradients and Directional Derivatives 


example | 


example 2 


In Section 2.1 we studied the graphs of real-valued functions. Now we take up this 
study again, using the methods of calculus. Specifically, gradients will be used to obtain 
a formula for the plane tangent to a level surface. 


Gradients in R° 


Let us recall the definition. 


Definition The Gradient If f: U c R? > Ris differentiable, the gradient 
of f at(x, y, z) is the vector in space given by 


i= 
ax’ ay’ az 
Thisvectoris also denoted Vf (x, y, z). Thus, Vf isjustthe matrix of the derivative 
Df, written as a vector. 


Let f(x, y, Z) = \/x*+ y? + z? = r, the distance from 0 to (x, y, z). Then 


of of of 
Vf (x, y, Z) = ax’ ay’ az 


E x y Z r 
7 Caeser VEHY +z? ol r' 


where r is the point (x, y, z). Thus, Vf is the unit vector in the direction of (x, y, 2). 
A 


If f(x, y, zZ) = xy +z, then 


of af əf 
BeF ae eed as = 1 rd); 
ax’ ay z) CNAA 


Vf(xX, y,z) = ( A 


Suppose f: R? — R is a real-valued function. Let v and x € R? be fixed vectors 
and consider the function from R to R defined by t +> f(x + tv). The set of points 
of the form x + tv,t € R, is the line L through the point x parallel to the vector v 
(see Figure 2.6.1). 


figure 2.6.1 The equation of L is (} = X + tv. 
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Directional Derivatives 


The function t +> f(x + tv) represents the function f restricted to the line L. For 
example, if a bird flies along this line with velocity v so that x + tv is its position at 
time t, and if f represents the temperature as a function of position, then f(x + tv) is 
the temperature at time t. We may ask: How fast are the values of f changing along the 
line L at the point x? Because the rate of change of a function is given by a derivative, 
we could say that the answer to this question is the value of the derivative of this function 
of t att = 0 (when t = 0, x + tv reduces to x). This would be the derivative of f at the 
point x in the direction of L; thatis, of v. We can formalize this concept as follows. 


Definition Directional Derivatives If f: R? > R, the directional deriva- 
tive of f atx along the vector vis given by 


d 
ae Xt) P 


if this exists. 

In the definition of a directional derivative, we normally choose v to be a unit 
vector. In this case, we are moving in the direction v with unit speed and we refer 
to $ f (x + tv)|,_ọ as the directional derivative of f in the direction v. 


We now elaborate on why a unit vector is chosen in the definition of the directional 
derivative. Suppose that f measures the temperature in degrees and that we are interested 
in how fast the temperature changes as we move in a particular direction. If we are 
measuring distance in meters, then the rate of change of temperature will be measured 
in degrees per meter. Suppose, for simplicity, that the temperature is changing at a 
constant rate— say, two degrees per meter— as we move in a given direction v starting at 
x. Thus, when we go one meter ahead, the temperature changes by two degrees. T hat is, 


f(x+v) — f(x) =2. 


Such a relation is going to hold only when vis a unit vector, reflecting the fact that we 
are going ahead by one meter. M ore generally, the definition of the directional derivative 
is going to truly measure only the rate of change of f with respect to distance along a 
line in a given direction if v is a unit vector. 

From the definition, we can see that the directional derivative can also be defined by 
the formula 


Theorem 12 If f: R? — R is differentiable, then all directional derivatives 
exist. The directional derivative at x in the direction v is given by 


Df (x)v = grad f (x) -v = Vf (x) -v = Zo] vı + HJ v + BJ V3, 


where v = (v1, v2, v3). 


example 3 


solution 


example 4 


solution 
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proof Letelt) = x+ tv, so that f(x+tv) = f(c¢(t)). By the first special case of 
the chain rule, (d/dt) f(¢(t)) = Vf(¢(t)) -c(t). However, c(0) = x and c(0) = v, 
and so 


d 
ae | x + bY) 


as we were required to prove. E 


t=0 


Notice that one does not have to use straight lines when computing the rate of change 
of f inaspecific direction v. Indeed, for a general path e(t) with c(0) = xand c'(0) = v, 
we have from the chain rule, 


d 


ag | (ett) 


t=0 


Let f(x, y, z) = xe”. Compute the rate of change of f in the direction of the unit 
vector 


1 1 1 
v= | —, = = at the point 1, 0, 0). 

(ERR) a ee 
The required rate of change is, using Theorem 12, 


Vf -v = (2xe-%, —x2ze-%, —x2ye7¥) . ( 


which, at the point (1, 0, 0), becomes 


1 1 1 2 
aN 0)- (z We a ar á 


In the last example, find the rate of change of f in the direction of the vector w = 
(1,1,1). 


w is not a unit vector. Replacing w by 


= TT = (7 T A) 


and proceeding as in Example 3, we again obtain 2/,/3 as our answer. A 


Directions of Fastest Increase 


From Theorem 12 we can also obtain the geometric significance of the gradient: 


Theorem 13 Assume Vf(x) 4 0. Then Vf (x) points in the direction along 
which f is increasing the fastest. 


proof |fmisaunitvector, therateof change of f indirection nis given by Vf (x) - n = 
\| Vf (x) || cos@, where @ is the angle between n and Vf (x). This is maximum when 
6 = 0; that is, when n and Vf are parallel. [If Vf (x) = O this rate of change is 0 for 
any n] m 
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In other words, if we wish to move in a direction in which f will increase most 
quickly, we should proceed in the direction Vf (x). Analogously, if we wish to moveina 
direction in which f decreases the fastest, we should proceed in the direction —Vf (x). 
example 5 | In what direction from (0, 1) does f(x, y) = x? — y? increase the fastest? 
solution | Thegradientis 
Vf = 2xi- 2yj, 
and so at (0, 1) thisis 
Vf kon = —2j. 


By Theorem 13, f increases fastest in the direction —j. (Can you see why this answer 
is consistent with Figure 2.1.9?) A 


Gradients and Tangent Planes to Level Sets 


Now we find the relationship between the gradient of a function f and its level surfaces. 
The gradient points in the direction in which the values of f change most rapidly, 
whereas a level surface lies in the directions in which they do not change at all. If f is 
reasonably well behaved, the gradient and the level surface will be perpendicular. 


Theorem 14 The Gradient is Normal to Level Surfaces Let f: R? > R 
beaC!mapandlet(xo, Yo, Zo) lieon the level surface S defined by f(x, y, z) =k, 
for k a constant. Then Vf (Xo, Yo, Zo) is normal to the level surface in the following 
sense: If vis the tangent vector att = 0 of a path c(t) in S with c(0) = (Xo, Yo, Zo), 
then Vf (Xo, Yo, Zo) -v = 0 (see Figure 2.6.2). 


proof Letc(t) lieinS; then f(¢(t)) = k. Letvbeasin the hypothesis; then v = c’(0). 
Hence, the fact that f(¢(t)) is constant in t, and the chain rule give 


V(X, Yo. Zo) 
parallel translated so 
that it begins 


at (Xo Yo, Zo) 


figure 2.6.2 Geometric 
significance of the gradient: V f is 
orthogonal to the surface S on 
which f is constant. 


example 6 


solution 
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If we study the conclusion of Theorem 14, we see that it is reasonable to define the 
plane tangent to S as the orthogonal plane to the gradient. 


Definition Tangent Planes to Level Surfaces Let S be the surface 
consisting of those (x, y, z) suchthat f(x, y, z) =k, fork a constant. The tangent 
plane of S ata point (Xo, Yo, Zo) of S is defined by the equation 


Vf (Xo, Yo. Zo) -(X — Xo, Y — Yo, Z — Zo) = 0 (1) 


if Vf (Xo, Yo, Zo) Æ 0. That is, the tangent plane is the set of points (x, y, z) that 
satisfy equation (1). 


This extends the definition we gave earlier for the tangent plane of the graph of a 
function (see Exercise 15 at the end of this section). 


Compute the equation of the plane tangent to the surface defined by 3xy + z? = 4 at 
(1,1, 1). 


Here f(x, y,z) = 3xy +z? and Vf = (3y, 3x, 2z), which at (1, 1, 1) is the vector 
(3, 3, 2). Thus, the tangent plane is 


(3,3, 2)-(x —1,y —1,z-1) =0; 
that is, 
3x + 3y + 2z = 8. A 


In Theorem 14 and the definition following it, we could just as well have worked in 
two dimensions as in three. Thus, if we have f : R? — R and consider a level curve 


C ={(x,y) | f(x,y) =k}, 


then Vf (Xo, Yo) is perpendicular to C for any point (Xo, yo) on C . Likewise, the tangent 
line to C at (Xo, Yo) has the equation 


Vf (Xo, Yo) +(X — Xo, Y — Yo) = 0 (2) 


if Vf (Xo, Yo) 4 0; that is, the tangent line is the set of points (x, y) that satisfy equation 
(2) (see Figure 2.6.3). 


y 


Vf translated 


figure 2.6.3 In the plane, the gradient V f 


is orthogonal to the curve f = constant. 
Tangent line to C 
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figure 2.6.4 The gradient Vf of a function 
f: R? > Ris a vector field on R°; at each point 
P;, VF(P;) is a vector emanating from P;. 


The Gradient Vector Field 


We often speak of Vf as a gradient vector field. The word “field” means that Vf assigns 
a vector to each point in the domain of f. In Figure 2.6.4 we describe the gradient Vf 
not by drawing its graph, which, if f: R? —> R, would bea subset of IR°— that is, the set 
of tuples (x, Vf (x)), but by representing Vf (P), for each point P, as a vector emanating 
from the point P rather than from the origin. Like a graph, this pictorial method of 
depicting Vf contains the point P and the value Vf (P) in the same picture. 

The gradient vector field has important geometric significance. It shows the direction 
in which f is increasing the fastest and the direction that is orthogonal to the level sur- 
faces (or curves in the plane) of f. That it does both of these at once is quite plausible. 
To see this, imagine a hill as shown in Figure 2.6.5(a). Let h be the height function, 
a function of two variables. If we draw level curves of h, these are just level contours 
of the hill. We could imagine them as level paths on the hill [see Figure 2.6.5(b)]. One 
thing should be obvious to anyone who has gone for a hike: To get to the top of the 
hill the fastest, we should walk perpendicular to level contours.° This is consistent with 
Theorems 13 and 14, which state that the direction of fastest increase (the gradient) is 
orthogonal to the level curves. 


A curve of 
steepest ascent 
up the hill 


Countour 
map of a hill 
250 feet high 


(a) (b) 
figure 2.6.5 A physical illustration of the two facts (a) Vf is the direction of fastest increase of f, 


and (b) Vf is orthogonal to the level curves. 


>This discussion assumes that one walks at the same speed in all directions. Of course, hikers know 
that this is not necessarily realistic. 


example 7 


example 8 


solution 


example 9 
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The gravitational force on a unit mass m at (x, y, z) produced by amass M atthe origin 
in IR? is, according to N ewton’s law of gravitation, given by 


_GmM 


F=- 


where G is a constant; r = |r|| = \/x2+ y? + z?, which is the distance of (x, y, z) 
from the origin; and n = r/r, the unit vector in the direction of r = xi + yj +zk, which 
is the position vector from the origin to (x, y, Z). 

Note that F = V(GmM /r) = —W ; that is, F is the negative of the gradient of the 
gravitational potential V = —G mM /r. This can be verified as in Example 1. Notice 
that F is directed inward toward the origin. Also, the level surfaces of V are spheres. 
The gradient vector field F is normal to these spheres, which confirms the result of 
Theorem 14. A 


Find a unit vector normal to the surface S given by z = x?y? + y + 1 at the point 
(0,0, 1). 


Let f(x, y, Z) = x?y?+y+1 —z, and consider the level surface defined by f(x, y, z) = 
0. Because this is the set of points (x, y, z) with z = x*y*+ y + 1, we see that this level 
set coincides with the surface S. The gradient is given by 

of. af. af 


22 sAi oe aa |e 2: 2 r 
E E a 2xyti + (2x*y + 1)j — k, 


and so 
vf(0,0,1)=j-—k. 


This vector is perpendicular to S at (0, 0, 1), and so to find a unit normal n we divide 
this vector by its length to obtain 


Vf (0, 0, 1) 1 


"= ivf(0,0, DI ~ 


k). 


= ll n 
Consider two conductors, one charged positively and the other negatively. Between 
them, an electric potential is set up. This potential is a function ¢: R? —> R (an example 
of ascalar field). Theelectric field is given by E = — Vø. F rom Theorem 14 we know that 


E is perpendicular to level surfaces of @. These level surfaces are called equipotential 
surfaces, because the potential is constant on them (see Figure 2.6.6). 


Lines of constant @ 
t 

J2 
\ ( uf Pa 


figure 2.6.6 Equipotential surfaces (the dotted 


lines) are orthogonal to the electric force field E. 
Lower 


otential 


— A 
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exercises 


1. 


Show that the directional derivative of 
f(x, y, z) = z2x + y3 at (1, 1, 2) in the direction 


(1//5)i + (2//5)j is 2/5. 


» Compute the directional derivatives of the following 


functions at the indicated points in the given 

directions: 

(a) f(x, y) =x + 2xy — 3y?, (Xo, yo) = (1, 2), 
v= ži + zj 

(b) f(x, y) = log y/x? + y?, (Xo, Yo) = (1, 0), 
v = (1/75)(2i +j) 

(c) f(x, y) = e* cos(zy), (Xo, Yo) = (0, —1), 
v= —(1//5)i + (2/V5)j 

(d) f(x, y) = xy? + x3y, (xo, yo) = (4, —2), 
v= (1/V10)i + (3/10)j 


. Compute the directional derivatives of the following 


functions along unit vectors at the indicated points in 
directions parallel to the given vector: 
(a) f(x, y) = x”, (xo, yo) = (e, e), d = 5i + 12j 
(b) f(x, y,z) = e* + yz, (Xo, Yo, Zo) = (1, 1, 1), 

d = (1, —1, 1) 
(c) f(x,y,z) = xyz, (Xo, yo, Zo) = (1, 0, 1), 

d = (1, 0, —1) 


» You are walking on the graph of f(x, y) = 


y cos(zx) — x cos(zry) + 10, standing at the point (2, 1, 
13). Find an x, y-direction you should walk in to stay at 
the same level. 


. (a) Let f:R? > R, xo € R3. If vis a unit vector in 


R3, show that the maximum value of the directional 
derivative of f at xo along vis ||V f (xo) |]. 

(b) Let f(x, y, z) = x3 — y3 + z3. Find the maximum 
value for the directional derivative of f at the point 
U-23): 


» Find a vector which is normal to the curve 


x3 + xy + y? = 11 at (1, 2). 


. Find the rate of change of f(x, y, z) = xyz in the 


direction normal to the surface yx? + xy? + yz? = 3 at 
(1, 1, 1). 


» Find the planes tangent to the following surfaces at the 


indicated points: 


(a) x? + 2y2 + 3xz = 10, atthe point (1, 2, 4) 
3 


10. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


(b) y? — x? = 3, at the point (1, 2, 8) 
(c) xyz = 1, atthe point (1, 1, 1) 


. Find the equation for the plane tangent to each surface 


z = f(x, y) at the indicated point: 


(a) z= x3 + y3 — 6xy, at the point (1, 2, —3) 
(b) z=(cosx)(cosy), at the point (0, 2/2, 0) 
(c) z = (cosx)(sin y), at the point (0, 7/2, 1) 


Compute the gradient Vf for each of the following 
functions: 


(a) f(x,y,z) =1/./x2 + y2 + 22 


(b) f(x,y,z) =xy +yz +xz 


f(x, y, zZ) = =—— 5> 
(c) f(x,y,z) Fae 
For the functions in Exercise 10, what is the direction of 
fastest increase at (1, 1, 1)? [The solution to part (c) 
only is in the Study Guide to this text.] 


Show that a unit normal to the surface 
x3y3 +y — z +2 = 0 at (0, 0, 2) is given by 
n= (1//2)(j — k). 


Find a unit normal to the surface cos(xy) = e? — 2 at 
(1, x, 0). 


Verify Theorems 13 and 14 for 
f(x, y, Z) =x2+y2422. 


Show that the definition following Theorem 14 yields, 
as a special case, the formula for the plane tangent to the 
graph of f(x, y) by regarding the graph as a level 
surface of F (x, y, z) = f(x, y) — z (see Section 2.3). 


Let f(x, y) = —(1 — x? — y?) 1/2 for (x, y) such that 
x? + y? < 1. Show that the plane tangent to the graph 
of f at (Xo, yo, f (Xo, Yo)) is orthogonal to the vector 
with components (Xo, Yo, f (Xo, Yo)). Interpret this 
geometrically. 


For the following functions f: R? —> R and 

g: R —> R3, find Vf and g' and evaluate ( f o g)’(1). 
(a) f(x,y,z) =xz+yz+xy, g(t) = (et, cost, sint) 
(b) f(x, y, z) = eX¥, g(t) = (6t, 3t2, t3) 


(c) f(x, y, Z) = (X2 + y? + z2) log \/x2 + y? 4-22, 


g(t) = (et, e+, t) 


18. 


19. 


20. 


21. 


22. 


23. 


24. 


25. 


Compute the directional derivative of f in the given 
directions v at the given points P. 


(a) f(x, y, Z) = Xy? + y2z3 + 23x, P = 
(4, —2, -1), v = 1/V14(i + 3j + 2k) 
(b) f(x,y,z) =x, P = (e, e, 0), v= i+ j+ 4k 


You are standing on the graph of 
f(x, y) = 100 — 2x? — 3y? at the point (2, 3, 65). 


(a) What are the xy coordinates of the highest point on 
the graph? 

(b) Show that the gradient of f is the zero vector at 
the point found in (a). 


Find the two points on the hyperboloid 
x2 + 4y? — z? = 4, where the tangent plane is parallel 
to the plane 2x + 2y +z=5. 


Letr = xi + yj +zk andr = |[r||. Prove that 


y I\_ r 

OQ- 

Captain Ralph is in trouble near the sunny side of 
Mercury. The temperature of the ship’s hull when he 
is at location (x, y z) will be given by 

T(x, y, z) = e™%*=2Y?°-3z° where x, y, and z are 
measured in meters. He is currently at (1, 1, 1). 


(a) In what direction should he proceed in order to 
decrease the temperature most rapidly? 


If the ship travels at e8 meters per second, how fast 
will be the temperature decrease if he proceeds in 
that direction? 


Unfortunately, the metal of the hull will crack if 
cooled at a rate greater than ./14e2 degrees per 
second. Describe the set of possible directions in 
which he may proceed to bring the temperature 
down at no more than that rate. 


(b 


(c 


A function f: R? — R is said to be independent of the 
second variable if there is a function g: R + R such 
that f(x, y) = g(x) for all x in R. In this case, calculate 
Vf in terms of g’. 


Let f and g be functions from R? to R. Suppose f is 
differentiable and Vf (x) = g(x)x. Show that spheres 
centered at the origin are contained in the level sets for 
f; that is, f is constant on such spheres. 


A function f: R” — Ris called an even function if 
f(x) = f(—x) for every xin R". If f is differentiable 
and even, find D f at the origin. 


26. 


27. 


28. 


29. 


30. 


31. 


32. 
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Suppose that a mountain has the shape of an elliptic 
paraboloid z = c — ax? — by?, where a, b, and c are 
positive constants, x and y are the east-west and 
north-south map coordinates, and z is the altitude above 
sea level (x, y, z are all measured in meters). At the 
point (1, 1), in what direction is the altitude increasing 
most rapidly? If a marble were released at (1, 1), in what 
direction would it begin to roll? 


An engineer wishes to build a railroad up the mountain 
of Exercise 26. Straight up the mountain is much too 
steep for the power of the engines. At the point (1, 1), in 
what directions may the track be laid so that it will be 
climbing with a 3% grade— that is, an 

angle whose tangent is 0.03? (There are two 
possibilities.) M ake a sketch of the situation indicating 
the two possible directions for a 3% grade at (1, 1). 


In electrostatics, the force P of attraction between two 
particles of opposite charge is given by P = k(r/lirI|?) 
(Coulomb’s law), where k is a constant and 

r = Xİ + yj + zk. Show that P is the gradient of 

f = —k/lir|]. 


The electrostatic potential V due to two infinite parallel 
filaments with linear charge densities 4 and —a is 

V = (à/2xe0) In(r2/r1), wherer? = (x — xo)? + y? 
andr? = (x + xo)? + y?. We think of the filaments as 
being in the z-direction, passing through the xy plane at 
(—xo, 0) and (xo, 0). Find VV (x, y). 


For each of the following, find the maximum and 
minimum values attained by the function f along the 
path c(t): 


(a) f(x, y) =xy;e(t) = (cost, sint);0<t < 2x 


(b) f(x, y) =x? + y?; clt) = (cost, 2sint);0 < t < 
2r 


Suppose that a particle is ejected from the surface 

x2 + y? — z? = —1 at the point (1, 1, V3) along the 
normal directed toward the xy plane to the surface at 
timet = 0 with a speed of 10 units per second. W hen 
and where does it cross the xy plane? 


Let f: R? > Rand regard Df (x, y, z) as a linear map 
of R? to R. Show that the kernel (that is, the set of 
vectors mapped to zero) of Df is the plane in R3 
orthogonal to Vf. 
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Differentiation 


review exercises for chapter 2 


1. 


10. 


11. 


Describe the graphs of: 


(a) f(x, y) = 3x2 + y? 
(b) f(x, y) =xy + 3x 


ı Describe some appropriate level surfaces and sections of 


the graphs of: 

(a) f(x, y, z) = 2x2 + y? +z? 
(b) f(x,y,z) =x? 

(c) f(x, y,z) = xyz 


. Compute the derivative Df (x) of each of the following 


functions: 

(a) f(x, y) = (xy, e-*Y) 
(b) f(x) =(x, x) 

(c) f(x,y,z) =e% +e” +e 
(d) f(x, y, 2) = (x,y,z) 


» Suppose f(x, y) = f(y, x) forall (x, y). Prove that 


(əf/əx)(a, b) = (əf /ay)(b, a). 


» Let f (u, v) = (cosu, v + sinu) and 


g(x, y, z) = (x? + ay, xz). Compute D ( f o g) at 
(0, 1, 1) using the chain rule. 


» Use the chain rule to find D ( f o g)(—2, 1) for 


f(u, v, w) = (v2? +uw, u2 + w2, u2v — w3) and 
g(x, y) = (xy, x? — y?, 3x + 5y). 


. Use the chain rule to find D ( f o g)(—1, 2) for 


f(u, v, w) = (v2 + w2, u3 — vw, u2v +w) and 
g(x, y) = (3x + 2y, x3y, y? — x2). 


» Let f(x, y) = (Xy, $, x + y) and 


g(w,s, t) = (we5, se"t), Find D (f og)(3, 1, 0). 


» Letr(t) = (tcos(zt), t sin(zt), t) be a path. Where will 


the tangent line to r att = 5 intersect the xy plane? 


Let f(x, y) =x2e-*Y, 


(a) Find a vector normal to the graph of f at(1, 2). 


(b) Find the equation of the tangent plane to the graph 
of f at(1, 2). 


(c) What point on the surface given by z = x? — y? has 
a tangent plane parallel to the plane found in (b)? 


Let f(x, y) = (1 — x? — y2?) 1/2, Show that the plane 
tangent to the graph of f at (xo, Yo, f (Xo, Yo)) is 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


orthogonal to the vector (Xo, yo, f (Xo, Yo)). Interpret 
geometrically. 


Let F (u, v) andu = h(x, y, z), v = k(x, y, z) be given 
(differentiable) real-valued functions and let f (x, y, z) 
be defined by f(x, y, z) = F (h(x, y, z), k(x, y, Z)). 
Write a formula for the gradient of f in terms of the 
partial derivatives of F , h, and k. 


Find an equation for the tangent plane of the graph of f 
at the point (xo, yo, f (xo, yo)) for: 
(a) f: R? —> R, (x,y) x—y+2, 
(xo, yo) = (1, 1) 
(b) f: R? > R, (x, y) > x? + 4y2, 
(xo, Yo) = (2, —1) 
(c) f: R? > R, (x, y) xy, 
(Xo, Yo) = (—1, —1) 
(d) f(x, y) = log(x +y) +x cosy + arctan(x + y), 
(xo, Yo) = (1, 0) 
(e) f(x,y) =x? +y? (xo, yo) = (1, 1) 


(f) f(x, y) =xy, (Xo, Yo) = (2, 1) 


Compute an equation for the tangent planes of the 
following surfaces at the indicated points. 


(a) x? +y? +z? =3, (1,1,1) 
(b) x3 — 2y3 +273 =0, (1, 1,1) 
(c) (cosx)(cosy)e*=0,  (x/2,1, 0) 
(d) eXY2 =1, (1, 1, 0) 


Draw some level curves for the following functions: 


(a) f(x,y) =1/xy 
(b) f(x, y) =x? — xy- y? 


Consider a temperature function T (x, y) = x sin y. Plot 
a few level curves. Compute VT and explain its meaning. 


Find the following limits if they exist: 


(a) limit(x, y (0,0) S= 


(b) limit, yy +(0,0) VI + y) x = LX # Y 
Compute the first partial derivatives and gradients of the 
following functions: 

(a) f(x, y, Z) = xe? + y cosx 

(b) f(x,y, z) = (x+y +2z)?° 

(c) f(x, y, Z) = (x? +y)/z 


19. 


20. 


21. 


22. 


23. 


Compute Ži exp(1 +x? + y?)] 


Let f: R?  R* and g: R? > R? be given by 
f(x, z) = (x? — y2, 0, sin(xy), 1) and 
g(x, y) = (ye*’, xeY’). Compute D ( f o g)(1, 2). 


Let f(x, y) = (x2 + y2)e-(*°+¥"+10) Find the rate of 
change of f at (2, 1) in the direction pointing toward the 
origin. 


Let y(x) bea differentiable function defined implicitly 
by F (x, y(x)) = 0. From Exercise 19(a), Section 2.5, we 
know that 


dy aF /ax 


dx dF /ay’ 


Consider the surface z = F (x, y), and suppose F is 
increasing as a function of x and as a function of y; that 
is, 0F /ax > 0 and dF /dy > 0. By considering the 
graph and the plane z = 0, show that for z fixed at z = 0, 
y should decrease as x increases and x should decrease 
as y increases. Does this agree with the minus sign in the 
formula for dy/dx? 


(a) Consider the graph of a function f(x, y) [Figure 
2.R.1(a)]. Let (Xo, yo) lie on a level curve C , so 
Vf (Xo, Yo) iS perpendicular to this curve. Show that 
the tangent plane of the graph is the plane that (i) 
contains the line perpendicular to Vf (Xo, yo) and 
lying in the horizontal planez = f (Xo, yo), and (ii) 
has slope || Vf (Xo, Yo) || relative to the xy plane. 


N 


Level curve 
raised to graph 


y 


lao yo) 


(a) 


Slope of tangent plane = ||Vt || 


24. 


25. 


26. 


27. 


28. 
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(By the slope of a plane P relative to the xy plane 
we mean the tangent of the angle 8, 0 < 0 < x, 
between the upward-pointing normal p to P and the 
unit vector k.) 


(b) Use this method to show that the tangent plane of 
the graph of f(x, y) = (x + cos y)x? at (1, 0, 2) is 
as sketched in Figure 2.R.1(b). 

Find the plane tangent to the surface z = x2 + y? at the 


point (1, —2, 5). Explain the geometric significance, for 
this surface, of the gradient of f(x, y) = x? + y? (see 
Exercise 23). 


In which direction is the directional derivative of 
f(x, y) = (x2 — y?) /(x? + y?) at (1, 1) equal to 
zero? 


Find the directional derivative of the given function at 
the given point and in the direction of the given vector. 


(a) f(x, y, z) = e* cos( yz), po = (0, 0, 0), 
v= (2,1, —2) 

(b) f(x,y,z) = xy +yz+2x, Po = (1,1, 2), 
v = (10, —1, 2) 


Find the tangent plane and normal to the hyperboloid 
x2 + y? — z? = 18 at (3, 5, —4). 


Let (x(t), y(t)) be a path in the plane, 0 < t < 1, and let 
f(x, y) beaC 1 function of two variables. Assume that 
(dx /dt) fy +(dy/dt) fy < 0. Show that f(x(1), y(1)) < 
f(x(0), y(0)). 


N 


figure 2.R.1 (a) The relationship between the gradient of a function and the tangent plane to 
the graph (Exercise 23(a)). For a specific instance of the tangent plane in (b) see Exercise 23(b). 
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29. 


30. 


31. 


32. 


33. 


34. 


35. 


36. 


Differentiation 


A bug finds itself in a toxic environment. The toxicity 
level is given by T (x, y) = 2x2 — 4y2. The bug is at 
(—1, 2). In what direction should it move to lower the 
toxicity the fastest? 


Find the direction in which the function w = x? + Xy 
increases most rapidly at the point (—1, 1). What is the 
magnitude of Vw atthis point? Interpret this magnitude 
geometrically. 


Let f be defined on an open set S in R". We say that f is 
homogeneous of degree p over S if f (Ax) = A? f (x) for 
every real à and for every xin S for which àx e S. 


(a) If such a function is differentiable at x, show that 
X- Vf (x) = pf (x). This is known as Euler’s 
theorem for homogeneous functions. [HINT: For 
fixed x, define g(A) = f (Ax) and compute g’(1).] 

(b) Find p and check Euler’s theorem for the function 
f(x, y, Z) = x — 2y —../xz, on the region where 
Xz >0. 


If z = [f (x — y)]/y (where f is differentiable and 
y Æ 0), show that the identity 
z + y(dz/dx) + y(dz/ay) = 0 holds. 


Given z = f((x + y)/(x — y)) for f aC! function, 
show that 


Let f have partial derivatives of (x) /ax;, where 

i =1,2,...,n, ateach point x of an open setU in R". If 
f has a local maximum or a local minimum at the point 
Xo in U , show that af (x9) /ax; = 0 for each i. 


Consider the functions defined in R? by the following 
formulas: 


(i) f(x,y) =xy/(x? +y?) if (x, y) # (0, 0), 
f(0,0) —0 

(ii), f(x, y) = x2y2/(x2 + y4) if (x, y) 4 (0, 0), 
f(0,0) —0 


(a) In each case, show that the partial derivatives 
af (x, y)/ax and af (x, y)/ay exist for every (x, y) 
in R2, and evaluate these derivatives explicitly in 
terms of x and y. 


(b) Explain why the functions described in (i) and (ii) 
are or are not differentiable at (0, 0). 


Compute the gradient vector Vf (x, y) atall points (x, y) 
in R2 for each of the following functions: 


(a) f(x, y) =x2y*log(x2+y?) if 
(x, y) Æ (0, 0), f(0,0) = 0 


37. 


38. 


39. 


4l. 


42. 


(b) f(x, y) = xy sin[1/(x? + y?)] if 
(x, y) 4 (0, 0), f(0,0) =0 


Find the directional derivatives of the following 
functions at the point (1, 1) in the direction (i+ j) //2: 


(a) f(x, y) =x tan-?(x/y) 


(b) f(x, y) = cos(4/x?2 + y?) 


(c) f(x, y) = exp(—x? — y?) 


(a) Letu =i -— 2j + 2k and v = 2i + j — 3k. Find: 
lulj, u-v, u x v, and a vector in the same direction 
as u, but of unit length. 


Find the rate of change of e*Y sin(xyz) in the 
direction u at (0, 1, 1). 


(b 


Let h(x, y) =2e-* + e-3Y° denote the height on a 
mountain at position (x, y). In what direction from (1, 0) 
should one begin walking in order to climb the fastest? 


. Compute an equation for the plane tangent to the graph 


of 


x 


e 
f(x,y) = Gag 


ax=l y=2. 


(a) Give a careful statement of the general form of the 
chain rule. 

(b) Let f(x, y) =x? + y and h(u) = (sin 3u, cos 8u). 
Let g(u) = f(h(u)). Compute dg/du atu = 0 both 
directly and by using the chain rule. 


(a) Sketch the level curves of f(x, y) = —x? — 9y? for 
c = 0, —1, —10. 
(b) On your sketch, draw in Vf at (1, 1). Discuss. 


. Attimet = 0, a particle is ejected from the surface 


x? + 2y? + 3z? = 6 at the point (1, 1, 1) in a direction 
normal to the surface at a speed of 10 units per second. 
At what time does it cross the sphere 

x? + y? +z? = 103? 


. At what point(s) on the surface in Exercise 43 is the 


normal vector parallel to the line x = y = z? 


» Compute dz/ax and əz/əy if 


z= u=e*-Y, v= ery 


Ba 


(a) by substitution and direct calculation, and (b) by the 
chain rule. 


46. 


47. 


48. 


49, 


50. 
51. 


52. 


53. 


54. 


55. 


56. 


Compute the partial derivatives as in Exercise 45 if 
Z=Uv,U=X+y,andv=x—y. 


W hat is wrong with the following argument? Suppose 
that w = f(x, y) and y = x2. By the chain rule, 


ðw OW Ax 
əx ax Ox 


aw ay aw aw 
ay ax ax ay” 


Hence, 0 = 2x(dw /ay), and so aw /ay = 0. Choose an 
explicit example to really see that this is incorrect. 


A boat is sailing northeast at 20 km/h. Assuming that the 
temperature drops at a rate of 0.2°C/km in the northerly 
direction and 0.3°C/km in the easterly direction, what is 
the time rate of change of temperature as observed on the 
boat? 


Use the chain rule to find a formula for 
(d /dt) exp[ f(t)g(t)]. 


Use the chain rule to find a formula for (d /dt)( f (t)9"")), 


Verify the chain rule for the function 
f(x, y, z) =[In(1 + x? + 2z2)]/(1 + y?) and the path 
c(t) = (t, 1—t?, cost). 


Verify the chain rule for the function 
f(x, y) =x2/(2 + cosy) and the path x =et, y = e~. 


Suppose that u(x, t) satisfies the differential equation 
ut + uux = 0 and that x, as a function x = f(t) of t, 
satisfies dx/dt = u(x, t). Prove that u( f (t), t) is 
constant in t. 


The displacement at time t and horizontal position on a 
line x of a certain violin string is given by 

u = sin(x — 6t) + sin (x + 6t). Calculate the velocity of 
the string at x = 1 whent = §. 


The ideal gaslaw PV =nRT involves aconstant R, the 
number n of moles of the gas, the volume V , the Kelvin 
temperature T , and the pressure P . 


(a) Show that each of n, P, T, V isa function of the 
remaining variables, and determine explicitly the 
defining equations. 

(b) Calculate əV /dT, aT /aP, aP /aV and show that 
their product equals —1. 


The potential temperature 0 is defined in terms of 
temperature T and pressure p by 


0.286 
Ton 
p 


57. 


58. 


59. 
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The temperature and pressure may be thought of as 
functions of position (x, y, z) in the atmosphere and also 
of timet. 


(a) Find formulas for 06/ax, 00/dy, 06/dz, 30 /əƏt in 
terms of partial derivatives of T and p. 


The condition 06/az < 0 is regarded as an 
unstable atmosphere, for it leads to large vertical 
excursions of air parcels from a single upward or 
downward impetus. M eteorologists use the 


formula 
06 o (aT 2 
əz T (əz Cy) 


where g = 32.2 and Cy is a positive constant. H ow 
does the temperature change in the upward direction 
for an unstable atmosphere? 


(b 


The specific volume V , pressure P , and temperature T 
of a van der Waals gas are related by 

P =RT/(V — 8) —a/V2, where œ, 8, and R are 
constants. 


(a) Explain why any two of V, P, and T can be 
considered independent variables that determine the 
third variable. 

(b) Find oT /aP, oP /aV, ƏV /aT . Identify which 
variables are constant, and interpret each partial 
derivative physically. 

(c) Verify that 
(oT /aP )(aP /aV )(aV /aT ) = —1( not +1!). 


The height h of the Hawaiian volcano M auna Loa is 
(roughly) described by the function 

h(x, y) = 2.59 — 0.00024y? — 0.00065x2, where 

h is the height above sea level in miles and x and y 
measure east-west and north-south distances in 
miles from the top of the mountain. At (x, y) = 
(—2, —4): 


(a) How fast is the height increasing in the direction 
(1, 1) (that is, northeastward)? Express your answer 
in miles of height per mile of horizontal distance 
traveled. 


(b) In what direction is the steepest upward path? 

(a) In what direction is the directional derivative of 
F(x, y) = (x? — y*)/(x? + y?) at (1, 1) 
equal to zero? 

(b) How about at an arbitrary point (Xo, yo) in the first 
quadrant? 

(c) Describe the level curves of f. In particular, discuss 


them in terms of the result of part (b). 
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Differentiation 


60. (a) Show that the curve x? — y? = c, for any value 


Show that u satisfies a (partial) differential equation of 


of c, satisfies the differential equation the form 
le a x2 ou 2 _ G(x, yu 

(b) Draw in afew of the curves x? — y? = c, say for ax ay 
c = +1. At several points (x, y) along each of these and find the function G (x, y). 
curves, draw a short segment of slope x /y; check 
that these segments appear to be tangent to the 62. (a) LetF beafunction of one variable and f a function 
curve. What happens when y = 0? What happens of two variables. Show that the gradient vector of 
when c = 0? g(x, y) = F ( f (x, y)) is parallel to the gradient 

vector of f(x, y). 
61. Suppose that f is a differentiable function of one (b) Let f(x, y) and g(x, y) be functions such that 


variable and that a function u = g(x, y) is defined by 


Vf = Avg for some function A(x, y). Whatis the 
relation between the level curves of f and g? 
X+ 1) Explain why there might be a function F such that 


u=g(x,y)=xyf ( xy g(x, y) =F (f(x, y)). 


3 


Higher-Order Derivatives: 
Maxima and Minima 


All that is superfluous displeases God and Nature. 


All that displeases God and Nature is evil. —Dante Alighieri, circa 1300 


... namely, because the shape of the whole universe is most perfect, and, in fact, designed 
by the wisest creator, nothing in all of the world will occur in which no maximum or minimum 


rule is somehow shining forth. —Leonhard Euler 


Leonhard Euler 


(by Emanuel à n ” 
n In one-variable calculus, to test a function f(x) for a local maximum 
(1707-1783). or minimum, we often use the second derivative. We look for critical 


points Xy—that is, points Xp for which f'(xo) =0, and at each such point we 
check the sign of the second derivative f”(xo). If f’(x%) <0, F(X) is a lo- 
cal maximum of f; if f’06) > 0, T0) is alocal minimum of f; if f’(x%>) =0, 
the test fails. 

This chapter extends these methods to real-valued functions of sev- 
eral variables. We begin in Section 3.1 with a discussion of iterated and 
higher-order partial derivatives, and in Section 3.2 we discuss the multi- 
variable form of Taylor's theorem; this is then used in Section 3.3 to derive 
tests for maxima, minima, and saddle points. As with functions of one 
variable, such methods help us to visualize the shape of a graph. 

In Section 3.4, we study the problem of maximizing a real-valued func- 
tion subject to supplementary conditions, also referred to as constraints. 
For example, we might wish to maximize f(x, y, 2 among those (x, y, 2) 
constrained to lie on the unit sphere, x?+ y?+z? = 1. Section 3.5 discusses 
a technical theorem (the implicit function theorem) useful for studying 


constraints. It will also be useful later in our study of surfaces. 
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3.1 Iterated Partial Derivatives 


example | 


solution 


example 2 


solution 


The preceding chapter developed considerable information concerning the derivative 
of a map and investigated the geometry associated with the derivative of real-valued 
functions by making use of the gradient. In this section, we proceed to study higher- 
order derivatives, with the goal of proving the equality of the “mixed second partial 
derivatives” of a function. We begin by defining the necessary terms. 


Likewise, if we say 
f is of class C3, we mean f has continuous iterated partial derivatives of third order, 
and so on. Here are a few examples of how second-order derivatives are written: 


vf a (=) vf a (=) Jj = >(£) a 
ax2 ax \ax/' = ax dys ax Lay J’  əzəy  əz\əy/' 
The process can, of course, be repeated for third-order derivatives, and so on. If f is 


a function of only x and y and af /dx, əf /ay are continuously differentiable, then by 
taking second partial derivatives, we get the four functions 


af af af af 
i f , and : 
3x? əy? ax Oy ay ax 


All of these are called iterated partial derivatives, while af 73x ay and a*f /ay ax are 
called mixed partial derivatives. 


Find all second partial derivatives of f(x, y) = xy +(x + 2y)?. 


The first partials are 


af of 
ae al (x + 2y) ay xX + 4(x + 2y) 
Now differentiate each of these expressions with respect to x and y: 
2 2 
a ag wh os 
Ox? 3y? 
f f 
axðəy `  aəyəðx A: 
Find all second partial derivatives of f (x, y) = sin x sin? y. 
We proceed just as in Example 1: 
a eee el Banyan cosy = sin x sin 2y; 
TA y, əy y cos y = y; 
af dence af 
a7 en y, Ta 
af af 


= cosx sin 2y, = 2 cosx sin y cos y = cosx sin2y. A 


ax ay dy ax 


example 3 
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Let f(x, y, z) = e" + zcosx. Then 


of of 

ay = Ye —28inx, oe = COSX, 
cal = —sinx is = —sinx etc 
əzəx '  ƏxƏZ ' ` A 


The Mixed Partials Are Equal 


In all these examples note that the pairs of mixed partial derivatives, such as df /ax ay 
and a?f /ay ax, or ə?f /Əz ax and df /ax dz, are equal. It is a basic and perhaps sur- 
prising fact that this is always the case for C ? functions. We shall prove this in the next 
theorem for functions f(x, y) of two variables, but the proof can be readily extended 
to functions of n variables. 


Theorem 1 If f(x, y) is of class C2 (is twice 
continuously differentiable), then the mixed partial derivatives are equal; that is, 
f əf 
axdy ay ax’ 


proof Consider the following expression (see Figure 3.1.1): 


S(Ax, Ay) = f (Xo + AX, Yo + Ay) — f (Xo + AX, Yo) 
— f (Xo, Yo + Ay) + f (Xo, Yo). 


Holding yo and Ay fixed, define 


g(x) = F(X, Yo + Ay) — f (xX, Yo), 


so that S( Ax, Ay) = g(Xo + AX) — g(Xo), which expresses S as a difference of differ- 
ences. By the mean-value theorem for functions of one variable, g(Xo + Ax) — g(Xo) 
equals g’(x) Ax for some x between Xo and Xp + Ax. Hence, 


of f 
S(Ax, Ay) = | —(X, Yo + Ay) — —(X, Yo) | Ax. 
y aX Yo y ax Yo 
y 
(Xo: Yo + Ay) (Xo + AX, Yo + Ay) 

C 7 + |P figure 3.1.1 The algebra behind the 
equality of mixed partials: writing the 
difference of differences in two 

AY t+ -|p ways. 

(Xo, Yo) (Xo+ AX, Yo) 
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Historical Note 


example 4 


solution 


Applying the mean-value theorem again, there is a y between yo and yo + Ay such that 


2 


AX, Ay) = 
SAKAN dy OX 


(X, Y) Ax Ay. 


Because df /ay ax is continuous, it follows that 
ə?f 


ay əx [S(Ax, Ay)]. 


Xo, Yo) = limit = ——_ 
o YO a A oy A Ay 
Noting that S is symmetric in Ax and Ay, we show ina similar way that af /ax ay is 
given by the same limit formula, which proves the result. m 


The equality of mixed partial derivatives is one of the most important results of 
multivariable calculus. It will reappear on several occasions later in the book, 
when we study vector identities. 

In the next historical note, we will discuss the role of partial derivatives in the 
formulation of many of the basic equations governing physical phenomena. 
One of the giants in this era was Leonhard Euler (1707-1783), who developed the 
equations of fluid mechanics that bear his name—the Euler equations. It was in 
connection with the needs of this development that he discovered, around 1734, 
the equality of mixed partial derivatives. Euler was about 27 years old at the time. 


In Exercise 17 we ask you to deduce from Theorem 1 that for aC ? function of x, y, 
and z, 
af a3f arf 


= = i etc. 
əxəyəz dzayax dy azax 


In other words, we can compute iterated partial derivatives in any order we please. 


Verify the equality of the mixed second partial derivatives for the function 


f(x, y) = xe” + yx?. 


Here 
of af 
—=e +2xy, — = xe’ +x?2, 
ox Texy oy i 
2f 3?f 
=e’ +2x, = @ + 2x, 
ay ax T ax ay jj 
and so we have 
3? f 3?f 


ay Ox z axoy A 


Sometimes the notation fx, fy, fz is used for the partial derivatives: fx = af/ax, and 
so on. With this notation, we write fxy = ( fx)y, and so equality of the mixed partials is 
denoted by fxy = fyx. Notice that f,y = 3?f /ay ax, so the order of x and y is reversed 
in the two notations; fortunately, the equality of mixed partials makes this potential 
ambiguity irrelevant. The following example illustrates this subscript notation. 


example 5 


solution 


Historical Note 
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Let 
z= f(x, y) =e* sinxy 
and write x = g(s, t), y = h(s, t) for certain functions g and h. Let 
k(s, t) = f(g(s, t), h(s, t)). 
Calculate kst. 
By the chain rule, 
ks = fxgs + fyhs = (e* sin xy + ye” cosxy)gs + (xe* cos xy)hs. 
Differentiating in t using the product rules gives 
kst = (fy )eQs + fx(Qs)e +( fy) hs + fylhs)h. 
Applying the chain rule again to ( fx), and ( fy), gives 
(fade = fexOt + fryht and (fy), = fyxde + fyyhe, 
and so ks, becomes 


kst = ( fxx9t + frye) Os + fxOst + ( fyxOt + fyyhe)hs + fyhst 


= fxx9t9s F fry (Megs F hsQt) F fyyhths + fx Ost + fyhst. 
Notice that this last formula is symmetric in (s, t), verifying the equality Ks = kts. 
Computing fxx, fry, and fyy, we get 


II 


kst = (e* sin xy + 2ye* cosxy — y?e* sin Xy) gtgs 
+(xe* cosxy + e* cosxy — xye* sin xy)(htgs + hs9t) 
— (x?e* sin xy)hth; + (e* sin xy + ye* cos xy) gst + (xe* cos XY) Net, 


in which itis understood that x = g(s, t) and y = h(s,t). A 


Some Partial Differential Equations 


Philosophy (nature) is written in that great book which ever is before our eyes—I mean the 
universe—but we cannot understand it if we do not first learn the language and grasp the 
symbols in which it is written. The book is written in mathematical language, and the 
symbols are triangles, circles and other geometrical figures, without whose help it is 
impossible to comprehend a single word of it; without which one wanders in vain through 
a dark labyrinth. —Galileo 


This quotation illustrates the Greek belief, again popular in the time of Galileo, 
that much of nature could be described using mathematics. In the latter part of 
the seventeenth century this thinking was dramatically reinforced when Newton 
used his law of gravitation to derive Kepler's three laws of celestial motion (see 
Section 4.1) to explain the tides, and to show that the earth was flattened at the 
poles. The impact of this philosophy on mathematics was substantial, and many 
mathematicians sought to “mathematize” nature. The extent to which 
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mathematics pervades the physical sciences today (and, to an increasing 
amount, economics and the social and life sciences) is testament to the success 
of these endeavors. Correspondingly, the attempts to mathematize nature have 
often led to new mathematical discoveries. 

Many of the laws of nature were described in terms of either ordinary 
differential equations (ODEs, equations involving the derivatives of functions of 
one variable alone, such as the laws of planetary motion) or partial differential 
equations (PDEs), that is, equations involving partial derivatives of functions. To 
give you some historical perspective and offer motivation for studying partial 
derivatives, we present a brief description of three of the most famous partial 
differential equations: the heat equation, the potential equation (or Laplace’s 
equation), and the wave equation. (Further information on some PDEs is given in 
Section 8.5.) 


THE HEAT EQUATION. In the early part of the nineteenth century the French 
mathematician Joseph Fourier (1768-1830) took up the study of heat. Heat flow 
had obvious applications to both industrial and scientific problems: A better 
understanding of it would, for example, make possible more efficient smelting 
of metals and would enable scientists to determine the temperature of a body 
given the temperature at its boundary, and to approximate the temperature of 
the earth’s interior. 

Let a homogeneous body $ c R? (Figure 3.1.2) be represented by some 
region in 3-space. Let T (x, y, z, t) denote the temperature of the body at the point 
(x, y, Z) at time t. Fourier proved, on the basis of physical principles (described in 
Section 8.5), that T must satisfy the partial differential equation called the heat 
equation, 


e | a?T | oar 1) 
ax2 3y? əz? at 
where kis a constant whose value depends on the conductivity of the material 
comprising the body. 

Fourier used this equation to solve problems in heat conduction. In fact, his 
investigations into the solutions of equation (1) led him to the discovery of Fourier 
series. 


X 


figure 3.1.2 A homogeneous body in space. 
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THE POTENTIAL EQUATION. Consider the gravitational potential V (often called 
Newton’s potential) of a mass mat a point (x, y, zZ) caused by a point mass M 
situated at the origin. This potential is given by V = —GmM/r, where 

x2 + y2 + z2. The potential V satisfies the equation 


PV PV PV ] 
a, (2) 
axe aye az? 


everywhere except at the origin, as we will check in the next chapter (see also 
Exercise 25). This equation is known as Laplace’s equation. Pierre-Simon de 
Laplace (1749-1827) had worked on the gravitational attraction of nonpoint 
masses and was the first to consider equation (2) with regard to gravitational 
attraction. He gave arguments (later shown to be incorrect) that equation (2) 
held for any body and any point whether inside or outside that body. However, 
Laplace was not the first person to write down equation (2). The potential 
equation appeared for the first time in one of Euler’s major papers in 1752, 
“Principles of the Motions of Fluids,” in which he derived the potential equation 
with regard to the motion of (incompressible) fluids. Euler remarked that he had 
no idea how fo solve equation (2). Poisson later showed that if (x, y, z) lies inside 
an attracting body, then V satisfies the equation 


ov ev oy 
axe ` aye ` əz? 


= —Azp, (3) 


where p is the mass density of the attracting body. Equation (3) is now called 
Poisson’s equation. Poisson was also the first to point out the importance 

of this equation for problems involving electric fields. Notice that if the 
temperature T is constant in time, then the heat equation (1) reduces to 
Laplace’s equation (2). 

Laplace’s and Poisson’s equations are fundamental to many fields besides 
fluid mechanics, gravitational fields, and electrostatic fields. For example, they 
are useful for studying soap films and liquid crystals (see The Parsimonious 
Universe: Shape and Form in the Natural World by S. Hildebrandt and A. Tromba, 
Springer-Verlag, New York/Berlin, 1995). 


THE WAVE EQUATION. The linear wave equation in space has the form 


of, Of Of ott 


axe" aye” az af’ (4) 

The one-dimensional wave equation 
af got i 
= (4’) 


ax © af 


was derived in about 1727 by Johann II Bernoulli and several years later by Jean 
Le Rond d’Alembert in the study of how to determine the motion of a vibrating 
string (such as a violin string). Equation (4) became useful in the study of both 
vibrating bodies and elasticity. As we shall see when we consider Maxwell’s 
equations for electromagnetism in Section 8.5, this equation also arises in the 
study of the propagation of electromagnetic radiation and sound waves. 
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example 6 | As we have pointed out, the heat equation, originating around 1800, is one of the 
important and classic partial differential equations. It describes the conduction of heat 
in a solid body. For example, understanding the dissipation of heat is important for 
industry, as well as for scientists understanding heat stresses on a capsule reentering the 
( ) earth’s atmosphere. 

N y J Consider a thin rod of length | (Figure 3.1.3). 


1 We now show that 
figure 3.1.3 A thin rod. 


1 


u(x, t) = oo 
is asolution of the heat equation 
du a’u 
at əx? 
solution By the chain rule 
ou 2 1 e-X?/4t 1 erat A es 
at 2t3/2 t1/2 dt \ 4t 
ee ee X? eat 
~ 23⁄2 t1⁄2 At2 
1 X 2 
= fa ex (at 
ssal T x) 
whereas 
ðU XO kt 
ax 2t3/2 
a7u 1 MP ug 
fete —x*/4t > aX /4t 
ae A tae 
ou 
at” 


This solution is called a fundamental solution to the heat equation. A 


exercises 


In Exercises 1 to 6, compute the second partial derivatives a2f /ax?, a2f /ax ay, d2f /ay ax, Ə?f /ay2 for each of the 
following functions. Verify Theorem 1 in each case. 


L f(x,y) = 2xy/(x2 + y?)?, on the region where 6. f(x, y) = log(x — y) 
(x, y) # (0, 0) ee . 
7. Find all second partial derivatives of the following 
2 f(x, y, z) =e +(1/x) + xe-Y, on the region where functions at the point xo. 
x 40 


(a) f(x, y) = sin(xy); X = (x, 1) 
3. f(x, y)= cos (xy?) (b) f(x,y) = xy8 + x? + yf; X = (2, —1) 


(c) f(x, y,z) = e*!7; xX = (0, 0, 0) 
4 f(x,y) = e7% + y3x4 
ue , 8. Find all second partial derivatives of 
5. f(x, y) = 1/(cos? x + e7”) f(x, y) = sec3(4y — 3x). 


9. Can there exista C? function f(x, y) with 
fx = 2x — 5y and fy = 4x + y? 


10. The heat conduction equation is ut = kuxx. Determine 
whether u(x, t) = e7kt sin(x) is a solution. 


IL Show that the following functions satisfy the 
one-dimensional wave equation 


f 61 att 
ax2 C2 ata” 
(a) f(x, t) = sin(x — ct) 
(b) f(x, t) = sin(x) sin(ct) 


(c) f(x, t) = (x —ct)® +(x + ct)ê 
I2. (a) Show that T (x, t) = e-*t cosx satisfies the 
one-dimensional heat equation 
eT ƏT 
ax? at’ 
Show that T (x, y, t) = e~t (cosx + cos y) satisfies 
the two-dimensional heat equation 
(5 a°T ) _ aT 
əx?  ay2 J at 


(b 


Show that T (x, y, z, t) = e-*t(cosx + 
cos y + cosz) satisfies the three-dimensional heat 
equation 


(c 


(5 at aT ) 0T 

əx? əy? az2 J ət’ 

I3. Find 3?z/əx?, a2z/ax ay, 02z/ay ax, and a2z/ay2 for 
(a) z = 3x? + 2y? 


(b) z= (2x2? + 7x2y) /3xy, on the region where x 4 0 
and y 40 


14. Find all the second partial derivatives of 
(a) z = sin (x? — 3xy) 
(b) z= x?y?e2*¥ 
15. Find fxy, fyz, fzx, and fxyz for 
f(x, y, Z) = xy + xy? + yz”, 
16. Letz = xty? — xê + y4. 


(a) Compute a3z/ay ax ax, 03z/ax ay ax, and 
a3z/ax ax day (also denoted a3z/ax2ay). 

(b) Compute a2z/ax ay ay, 03z/ay ax ay, and 
a2z/ay ay ax (also denoted 43z/ay2ax). 


17. Use Theorem 1 to show that if f(x, y, z) is of class C 3, 
then 
af af 
əx ay dz E əy əz OX" 
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18. Verify that 


Bf f 
ax ay az  əzəðyəX 


for f(x, y, z) = ze% + yz3x2, 


19. Verify that fxzw = frwx for 
f(x, y,Z,w) =e*¥ sin (xw). 


20. If f(x, y, z, w) is of class C 3, show that fxzw = frwx. 


2L Evaluate all first and second partial derivatives of the 
following functions: 


(a) f(x, y) = x arctan (x/y) 


(b) f(x, y) = cos 4/x2 + y2 


(c) f(x, y) = exp(—x? — y?) 


22. Letw = f(x, y) bea function of two variables and let 
X = U +v, y =u — v. Show that 


w aw aw 
ðuðv əx? əy?” 


23. Let f: R? —> R bea C? function and let qt) be a C 2 
curve in R2. Write a formula for the second derivative 
(d2/dt2)((f o @(t)) using the chain rule twice. 


24. Let f(x, y, z) =e%? tan (yz) and let x = g(s, t), 
y = h(s, t), z = k(s, t), and define the function 
m(s, t) = f (g(s, t), h(s, t), k(s, t)). Find a formula for 
Mst using the chain rule and verify that your answer is 
symmetric in s and t. 


25. A function u = f(x, y) with continuous second partial 
derivatives satisfying Laplace's equation 


ðu 3u i 
ax2 əy? 


is called a harmonic function. Show that the function 
u(x, y) = x3 — 3xy? is harmonic. 


26. Which of the following functions are harmonic? (See 
Exercise 25.) 
(a) f(x, y) =x? — y? 
(b) f(x,y) =x? +y? 
(c) f(x,y) =xy 
(d) f(x, y) = y? + 3xy 
(e) f(x, y) = sinx cosh y 
(f) f(x, y) =e* siny 
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27. (a) Is the function f(x, y, z) = x? — 2y? + z? 3L Show that N ewton’s potential V = —G mM /r satisfies 
harmonic? What about f (x, y, z) = X? + y? —z2? Laplace's equation 
(b) Laplace's equation for functions of n variables is 3V aV 3V 
+ =0 for (x,y,z) Æ (0,0,0). 
ax? əy? © az? 
at att aa a 
axe) ag OR 32. Let 
2 _ y2) (x2 4 y2 
Find an example of a function of n variables that is f(x,y) = as YRA EYA 7 : 
, X, y) =, 


harmonic, and show that your example is harmonic. 
(see Figure 3.1.4). 


(a) If (x, y) 4 (0, 0), calculate əf /ax and of /ay. 
(b) Show that (af /ax)(0, 0) = 0 = (af /ay)(0, 0). 


(c) Show that (a2f /ax ay)(0, 0) = 1, 
(9?f /ay ax)(0, 0) = —1. 


28. Show that the following functions are harmonic: 
(a) f(x, y) = arctan ¥ 
(b) f(x, y) = log(x? + y?) 


29. Let f and g be C2 functions of one variable. Set 


o = f(x —t) +(x +t). (d) What went wrong? Why are the mixed partials not 
equal? 


(a) Prove that ¢ satisfies the wave equation: 
a2 /at2 = a2 /ax2. 

(b) Sketch the graph of œ against t and x if f(x) = x? 
and g(x) = 0. 


30. (a) Show that function g(x, t) = 2 + e™ sin x satisfies 
the heat equation: gt = gxx. [Here g(x, t) 
represents the temperature in a metal rod at position 
x and timet.] 

(b) Sketch the graph of g fort > 0. (HINT: Look at 
sections by the planes t = 0, t = 1, and t = 2.) 


(c) What happens to g(x, t) ast — oo? Interpret this 
limitin terms of the behavior of heat in the rod. figure 3.1.4 The graph of the function in Exercise 32. 


3.2 Taylor’s Theorem 


W hen we introduced the derivative in Chapter 2, we saw that the linear approximation 
of a function played an essential role for a geometric reason— finding the equation of a 
tangent plane— as well as an analytic reason— finding approximate values of functions. 
Taylor’s theorem deals with the important issue of finding quadratic and higher-order 
approximations. 

Taylor's theorem is a central tool for finding accurate numerical approximations of 
functions, and as such plays animportantrolein many areas of applied and computational 
mathematics. We shall use it in the next section to develop the second derivative test for 
maxima and minima of functions of several variables. 

The strategy used to prove Taylor's theorem is to reduce it to the one-variable case by 
probing a function of many variables along lines of the form I(t) = X% + th emanating 
from a point % and heading in the direction h. Thus, it will be useful for us to begin by 
reviewing Taylor's theorem from one-variable calculus. 


Single-Variable Taylor Theorem 


W hen recalling atheorem from an earlier course, itis helpful to ask these basic questions: 
W hat is the main point of the theorem? W hat are the key ideas in the proof? Can | 
understand the result better the second time around? 
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The main point of the single-variable Taylor theorem is to find approximations of a 
function near a given point that are accurate to a higher order than the linear approxima- 
tion. The key ideain the proof is to use the fundamental theorem of calculus, followed by 
integration by parts. In fact, just by recalling these basic ideas, we can reconstruct the en- 
tire proof. Thinking in this way will help organizeall the pieces that need to come together 
to develop a mastery of Taylor approximations of functions of one and several variables. 

For a smooth function f : R — R of one variable, Taylor's theorem asserts that: 


f"(xo) 2 m f(O (xo) 


; _ i hk + Ry(Xo,h), (1) 


f (Xo +h) = f(Xo) + f’(Xo) -h + 


where 


Xoth _ k 
Rio, h) = f aaile T 


is the remainder. For small h, this remainder is small to order k in the sense that 


lim Rx (Xo, h) 


U (2) 


In other words, Ry(Xo, h) is small compared to the already small quantity h“. 
The preceding is the formal statement of Taylor’s theorem. W hat about the proof? 
As promised, we begin with the fundamental theorem of calculus, written in the form: 


Xoth 
(xp +h) = Flo) +f f(r) dr. 
Next, we write dt = —d(X9 +h — t) and integrate parts! to give: 


Xo+h 
(xo +h) = f (xo) + ooh f E ee ae 


Xo 


which is the first-order Taylor formula. Integrating by parts again: 


Xo+h 
| Mider hedi: 


x 1 Xo+h 
” -5 F(x) d(xo +h — t)? 


Lene yg 1 pe 
Z zon f f”lr)lXo +h — t)? dr, 


which, when substituted into the preceding formula, gives the second-order Taylor 
formula: 


This is Taylor's theorem for k = 2. 


1Recall that integration by parts (the product rule for the derivative read backward) reads as: 


b b 
/ udv=uot- f vdu. 
a a 


Here we choose u = f’(t) and v = Xo +h — T. 
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Taylor's theorem for general k proceeds by repeated integration by parts. T he state- 
ment (2) that Ry (Xo, h)/h* —> 0 ash — 0 is seen as follows. For z in the interval 
[Xo, Xo + h], we have [Xp +h — t| < jh], and f**1(z), being continuous, is bounded; 
say, | f*++(r)| < M. Then: 

Ta is 
< 


[Rx(Xo, h)| = fr) dri< A M 


“Th (Xo +h — t)“ 
J 


0 


and, in particular, |Rx(Xo, h)/h*| < |h| M/k! > Oash > 0. 


Taylor's Theorem for Many Variables 


Our next goal in this section is to prove an analogous theorem that is valid for functions 
of several variables. We already know a first-order version; thatis, when k = 1. Indeed, 
if f: R" > Ris differentiable at X and we define 


R(X, h) = f(% + h) — f(%) — [Df (x) ](h), 
so that 

f(% +h) = f(x) + [DF (x) ](h) + R106, h), 
then by the definition of differentiability, 


|Ri(%, h)| 
|| h| 


that is, R1(%, h) vanishes to first order at xX. In summary, we have: 


>0 as h—0; 


Theorem 2 First-Order Taylor Formula Let f:U c R" — R be differ- 
entiable at X% € U . Then 


Taah = to +Y 9 9) Ralo N 


where Rı(%, h)/|ih| —> 0 ash— Oin R’. 


The second-order version is as follows: 


Theorem 3 Second-Order Taylor Formula Let f:U c R" —> R have 
continuous partial derivatives of third order.? Then we may write 


Hoe = roa +m Soa 3 Sin EE) Rabe: h 


where R2(%, hh) /||h||2 —> 0 as h —> Oand the second sum is over all i’s and j's 
between 1 and n (so there are n2 terms). 


2For the statement of the theorem as given here, f actually needs only to be of class C ?, but for a 
convenient form of the remainder we assume f is of class C 3. 
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Notice that this result can be written in matrix form as 


hy 
of of 
f h) = f L eaaa a IN 
(%) +h) = f(%) + È z] 
hn 
ef 3f a2f 
0X1 OX) OX 1 OX2 0X1 OXn h 
1 
əf 3?f a2f h 
TURAT OX2 0X1 0X2 0X2 3X2 OXn ? 
af af af Mn 
OXn ðX OXp OX Xn OXn 
+ R2(%, h), 


where the derivatives of f are evaluated at x. 
In the course of the proof of the T heorem 3, we shall obtain a useful explicit formula 
for the remainder, as in the single-variable theorem. 


proof of theorem 3 Let g(t) = f(x + th) with x and h fixed, which is a C3 
function of t. Now apply the single-variable Taylor theorem (1) to g, with k = 2, to 
obtain 


g”(0) 


7i + Ro, 


g(1) = g(0) + 9/(0) + 


where 


Of _ 
1 = — ' Hh = th)hih;, 
g'(t) c orth: g(t) Do mag T Jhih; 
and 
"=> AE E 
' T 2a KOK) Om iNjNk. 


Writing R2 = R2(%, h), we have thus proved: 


af Le af 
f(% +h) = f(%) + 5 hi 5 00) +5 5 a ERa h), 
i l i,j=l l 


Ral n= 5 e Di hiheti 
a ~ 2 mS ee 
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The integrand is a continuous function of t and is therefore bounded by a positive 
constant C on a small neighborhood of X (because it has to be close to its value at x). 
Also note that |h;| < ||hij, for ||Ih|| small, and so 


|Ro(%, h)| < || HIPC. (4) 
In particular, 


|R2(%, h)| 
||? 


as required by the theorem. 

The proof of Theorem 2 follows analogously from the Taylor formula (1) withk = 1. 
A similar argument for Ri shows that |Ri(%, h)|/||Ih|| + 0 as h > Q although this 
also follows directly from the definition of differentiability. m 


<||hiC +0 a h-O 


Forms of the Remainder In Theorem 2, 


1 oi 
Ri(%, w= > fo aay 0 + eR aE = DO zgage hh 
(5) 
where G; lies somewhere on the line joining x to X% +h 
In Theorem 3, 
(t—1)? əf 
/h) = th)hjhjhy dt 
R2(%, h) = a a K Hhhh d 
1 ai 

= `. hh; hg, 1 
2 3! OX) OX) ax, (Sit) Je (5! 


where Gjk lies somewhere on the line joining X to x + h 


The formulas involving qj and Gję (called Lagrange’s form of the remainder) are 
obtained by making use of the second mean-value theorem for integrals. This states that 


f roat t) dt = h(c o fatty 


provided h and g are continuous and g > 0 on [a, b]; here c is some number between 
a and b.? This is applied in formula (4) for the explicit form of the remainder with 
h(t) = (ə?f /ax| Ax; )(% + th) and g(t) =1-t. 


3Proof If g = 0, the result is clear, so we can suppose g # 0; thus, we can assume f’ g(t) dt > 0. 
Let M and m be the maximum and minimum values of h, achieved atty and tm, respectively. io 


g(t) = 0, , : ' 
nf TEEI hotats f g(t) dt 


Thus, ( f? h(t)g(t) dt) /( f? g(t) dt) lies between m = h(tm) and M = h(ty) and therefore, by the 
intermediate-value theorem, equals h(c) for some intermediatec. E 


3.2 Taylor's Theorem 


The third-order Taylor formula is 


n 

f(x +h) = F(%) +Y hi — 0%) + 
i=] j=l 

: > hihih aa (X) + R3(%, h) 

3 2 irj ag t 31X0, 1 
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where R3(%, hh) /||hl|? — 0 as h —> O and so on. The general formula can be proved 


by induction, using the method of proof already given. 


example | Compute the second-order Taylor formula for thefunction f(x, y) = sin(x+2y), about 


the point x = (0, 0). 


solution Notice that 


f (0, 0) = 0, 
of af 
ay O 0) = cos(O +.2-0) = 1, ay ee aD} 2, 
a°f 3f 3?f 
za 0) = 0, ayz 0) = 0, X ayo 0) =0. 
Thus, 
f (h) = f (hi, h2) = hy + 2h2 + R2(0 h), 
where 
ae >0 a h>0 4 


example 2 | Computethesecond-order Taylorformulafor f(x, y) = e* cosy aboutthe point xo = 0, 


Yo = 0. 


solution Here 


#(0,0)=1, (oo =1, (0,0) =0, 
Ox 
a g=, (0 ay Sad, 2 0, 0) =0, 
and so 
f(h) = f (ha, h2) =1 +h: + $h7 — 4nd + R2(0 h), 
where 


Rita >0 a h-O 
|| h/? A 
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example 3 


solution 


example 4 


In the case of functions of one variable, we can expand f(x) in an infinite power 
series, called the Taylor series: 


f” h2 f (k) hk 
(Xo) ecg ce (Xo) 


f(xo +h) = f (Xo) + F'(xo)h + —; ag 


provided we can show that Rx(Xo, h) —> Oask — œ. Similarly, for functions of several 
variables, the preceding terms are replaced by the corresponding ones involving partial 
derivatives, as we have seen in Theorem 3. Again, we can represent such a function by 
its Taylor series provided we can show that R, — 0 ask — oo. This point is examined 
further in Exercise 13. 

The first-, second-, and third-order Taylor polynomials are also called the first-, 
second-, and third-order Taylor approximations to f, since it is presumed that the 
remainder is small and gets smaller as the order of the Taylor polynomial increases. 


Find the first- and second-order Taylor approximations to f(x, y) = sin(xy) atthe point 
(Xo, Yo) = (1, 1/2). 


Here 
f (Xo, Yo) = SIN(XoYo) = sin (7/2) =1 
fx (Xo, Yo) = Yo COS(XoYo) = F cos (7/2) = 0 
fy(Xo, Yo) = Xo COS(XoYo) = cos (7/2) = 0 
7 m? i m? 
fxx(Xo, Yo) = —YG SIN(XoYo) = -7 sin (x/2) = a 
fxy(Xo, Yo) = COS(XoYo) — XoYo SIN(XoYo) = = sin (7/2) = =5 
fyy(Xo, Yo) = —xő sin (Xoyo) = —sin (7/2) = —1. 


Thus, the linear (first-order) approximation is 


I(x, y) = f (Xo, Yo) + fx(Xo, Yo)(X — Xo) + fy(Xo, Yo)( y — Yo) 
=14+0+0=1, 


and the second-order (or quadratic) approximation is 


I m? > 7 A 
gy) =1+0+0+ 5-7) -0+ ( 5 ) v(y >) 


1 m\? 
alr zD) (y = 5) 


See Figure 3.2.1. A 


Find linear and quadratic approximations to the expression (3.98 — 1)?/(5.97 — 3)’. 
Compare with the exact value. 
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figure 3.2.1 The linear and 
quadratic approximations to 
Z= sin (xy) near (1, 2/2). 


solution Let f(x, y) = (x —1)?/(y — 3)?. The desired expression is close to f (4,6) = 1. To 
find the approximations, we differentiate: 


. 2(x — 1) _ -2(x — 1)? 
"(y= 3)?’ (y= 333 
= —4(x = 1) 2 6(x — 1)? 


fxy — fyx = (y—3)3' 
At the point of approximation, we have 


2 2 2 2 
46)= 5, h=- fy=sfn=-py asy fw =F. 


The linear approximation is then 
2 
-£ 


1+ Z (—0.02) 3 —0.03) = 1.00666. 


The quadratic approximation is 


2 2 2 (—0.02)? 4 2 (—0.03)? 
1+ =(—0.02) — =(—0.03 0.02)(—0.03) + = 
+ 3I ) 3! +5 5 z! )( )+3 5 
= 1.00674. 
The “exact” value using a calculator is 1.00675. A 
exercises 
L Let f(x,z) =e*tY, (a) Find the first-order Taylor approximation for L. 
(a) Find the first-order Taylor formula for f at (0, 0). (b) Ae second-order Taylor approximation 
(b) Find the second-order Taylor formula for f at (0, 0). tes Satie oo. 
(c) What will higher-order approximations look 
like? 


2 Suppose L: IR? — R is linear, so that L has the form 
L(x, y) =ax + by. 
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In each of Exercises 3 to 8, determine the second-order Taylor formula for the given function about the given point (Xo, Yo). 


f(x, y) = (x + y)?, where xo = 0, yo = 0 

f(x, y) = 1/(x2 + y? + 1), where xo = 0, yo = 0 
f(x, y) = e**Y, where xo = 0, yo = 0 

f(x, y) = e=’? cos (xy), where xo = 0, yo = 0 
f(x, y) = sin (xy) + cos(xy), where xo = 0, yo = 0 
f(x, y) = el*-))” cosy, where xo = 1, yo = 0 


Calculate the second-order Taylor approximation to 
f(x, y) = cosx sin y at the point (x, 2/2). 


Let f(x, y) = x cos(zy) — y sin(zx). Find the 
second-order Taylor approximation for f at the point 
(1, 2). 


Let g(x, y) = sin(xy) — 3x? log y + 1. Find the 


f(- 


1, —1). Compare your approximation to the exact 


value using a calculator. 


13. (Challenging) A function f: R > Ris called an 
analytic function provided 


f(x +h) = f(x) + Oh 


fk) (x) 


k 
g t+ 


[i.e., the series on the right-hand side converges and 
equals f(x +h)]. 


(a) 


Suppose f satisfies the following condition: On any 
closed interval [a, b], there is a constant M such 
that for all k = 1, 2, 3,..., |f(x)| < M for all 
x € [a, b]. Prove that f is analytic. 


eT l/x x>0 
Let f(x) = { 
0 x <0. 
Show that f isaC °° function, but f is not analytic. 


degree 2 polynomial which best approximates g near the 


point (2/2, 1). 


(c) Give a definition of analytic functions from R” to 
R. Generalize the proof of part (a) to this class of 


functions. 
12. For each of the functions in Exercises 3 to 7, use the (d) Develop f(x, y) = e*+Y in a power series about 
second-order Taylor formula to approximate xo = 0, yo = 0. 


3.3 Extrema of Real-Valued Functions 


Historical Note 


As we saw in the book’s Historical Introduction, the early Greeks sought to 
mathematize nature and to find, as in the geometric Ptolemaic model of 
planetary motion, mathematical laws governing the universe. With the revival of 
Greek learning during the Renaissance, this point of view again took hold and 
the search for these laws recommenced. In particular, the question was raised as 
to whether there was one law, one mathematical principle that governed and 
superseded all others, a principle that the Creator used in His Grand Design of 
the Universe. 


MAUPERTUIS’ PRINCIPLE. In 1744, the French scientist Pierre-Louis de Maupertuis 
(see Figure 3.3.1) put forth his grand scheme of the world. The “metaphysical 
principle” of Maupertuis is the assumption that nature always operates with the 
greatest possible economy. In short, physical laws are a consequence of a 
principle of “economy of means”; nature always acts in such a way as to 
minimize some quantity that Maupertuis called the action. Action was nothing 
more than the expenditure of energy over time, or energy x time. In 
applications, the type of energy changes with each case. For example, physical 
systems often try to “rearrange themselves” to have a minimum energy—such as 
a ball rolling from a mountain peak to a valley, or the primordial irregular earth 
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figure 3.3.1 Pierre-Louis de Maupertuis (1698-1759). 


assuming a more nearly spherical shape. As another example, the spherical 
shape of soap bubbles is connected with the fact that soheres are the surfaces of 
least area containing a fixed volume. 

We state Maupertuis’ principle formally as: Nature always minimizes action. 
Maupertuis saw in this principle an expression of the wisdom of the Supreme 
Being, of God, according to which everything in nature is performed in the most 
economical way. He wrote: 


What satisfaction for the human spirit that, in contemplating these laws 
which contain the principle of motion and of rest for all bodies in the 
universe, he finds the proof of existence of Him who governs the world. 


Maupertuis indeed believed that he had discovered God’s fundamental law, the 
very secret of Creation itself, but he was actually not the first person to pose this 
principle. 

In 1707, Leibniz wrote down the principle of least action in a letter to Johann 
Bernoulli, which became lost until 1913, when it was discovered in Germany’s 
Gotha library. For Leibniz, this principle was a natural outgrowth of his great 
philosophical treatise The Theodicy, in which he argues that God may indeed 
think of all possible worlds, but would want to create only the best among them; 
and hence our world is necessarily the best of all possible worlds. 

Action, as defined by Leibniz, was motivated by the following reasoning, used 
in his letter. Think of a hiker walking along a road, and consider how to describe 
his action. If he travels 2 kilometers in 1 hour, you would say that he has carried 
out twice as much action as he would if he traveled 2 kilometers in 2 hours. 
However, you would also say that he carries out twice as much action in traveling 
2 kilometers in 2 hours as he would in traveling 1 kilometer in 1 hour. Altogether 
then, our hiker, by walking 2 kilometers in 1 hour, carries out 4 times as much 
action as he would in traveling 1 kilometer in 1 hour. 
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Using this intuitive idea, Maupertuis defined action as the product of distance, 
velocity, and mass: 


Action = Mass x Distance x Velocity. 


Mass is included in this definition to account for the hiker’s backpack. Moreover, 
according to Leibniz, the kinetic energy E is given by the formula: 


E= y x Mass x (Velocity)’. 


So action has the same physical dimension as 
Energy x Time, 


because velocity is distance divided by time. 

In the 250 years after Maupertuis formulated his principle, this principle of least 
action has been found to be a “theoretical basis” for Newton’s law of gravity, 
Maxwell's equations for electromagnetism, Schrodinger’s equation of quantum 
mechanics, and Einstein’s field equation in general relativity. 

There is much more to the story of the least-action principle, which we will 
revisit in Section 4.1 and in the Internet supplement. 


Maxima and Minima for Functions of n-Variables 


As the previous remarks show, for Leibniz, Euler, and M aupertuis, and for much of 
modern science as well, all in nature is a consequence of some maximum or minimum 
‘principle. To make such grand schemes— as well as some that are more down to earth— 

effective, we must first learn the techniques of how to find maxima and minima of 
functions of n variables. 


Extreme Points 


Among the most basic geometric features of the graph of a function are its extreme 
points, at which the function attains its greatest and least values. In this section, we 
derive a method for determining these points. In fact, the method locates local extrema 
as well. These are points at which the function attains a maximum or minimum value 
relative only to nearby points. Let us begin by defining our terms. 


Definition If f: U c R” > Risa given scalar function, a point X e U is 
called a local minimum of f if there is a neighborhood V of X such that for all 
points xin V, f(x) > f(%). (See Figure 3.3.2.) Similarly, x € U is a local 
maximum if there is a neighborhood V of X such that f (x) < f(x) forallxe V. 
The point x € U is said to be alocal, or relative, extremum if it is either a local 
minimum or a local maximum. A point X is a critical point of f if either f is 
not differentiable at %, or if itis, Df (X) = Q A critical point that is not a local 
extremum is called a saddle point 4 


‘The term “saddle point” is sometimes not used this generally; we shall discuss saddle points further 
in the subsequent development. 


figure 3.3.2 (a) Local minimum 
and (b) local maximum points 
for a function of two variables. 
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Graph of t Graph of t 
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p >y eT | >y 


X è X 
Xo Žo 
Local minimum Local maximum 
(a) (b) 


First-Derivative Test for Local Extrema 


The location of extrema is based on the following fact, which should be familiar from 
one-variable calculus (the casen = 1) 


Theorem 4 First-Derivative Test for Local Extrema If U c R" is open, 
the function f: U c R” —> Ris differentiable, and X € U isa local extremum, 
then Df (%)) = Q that is, % is a critical point of f. 


proof Suppose that f achieves a local maximum at Xo. Then for any h e R°, 
the function g(t) = f(x) + th) has a local maximum at t = 0. Thus, from one- 
variable calculus g’(0) = 0.° On the other hand, by the chain rule, 


g'(0) = [Df (%)]h. 


Thus, [Df (x) ]h = 0 for every h, and so Df (x)) = O The case in which f achieves a 
local minimum at % is entirely analogous. E 


If we remember that Df (X) = Omeans that all the components of Df (x) are zero, 
we can rephrase the result of Theorem 4: If X is a local extremum, then 


of ; 

TA = 9 i=1,....0 
that is, each partial derivative is zero at xX. In other words, Vf (x) = O where Vf is 
the gradient of f. 

If weseek to find the extrema or local extrema of a function, then Theorem 4 states that 
we should look among the critical points. Sometimes these can be tested by inspection, 
but usually we use tests (to be developed below) analogous to the second-derivative test 
in one-variable calculus. 


>Recall the proof from one-variable calculus: Because g(0) is alocal maximum, g(t) < g(0) for small 
t > 0, so g(t) — g(0) < 0, and hence g’(0) = limit,_.9+ (g(t) — g(0))/t < 0, where limit: „o+ means 
the limit as t + 0, t > 0. For small t < 0, we similarly have g’(0) = limito- (g(t) — g(0))/t > 0. 
Therefore, g'(0) = 0. 
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example | 


solution 


example 2 


solution 


example 3 


solution 


Find the maxima and minima of thefunction f : R? —> R, defined by f(x, y) = X? + y?. 
(Ignore the fact that this example can be done by inspection.) 


We first identify the critical points of f by solving the two equations af /ax = 0 and 
af /ay = 0, for x and y. But 


af of 
a= 2x and ay > 2y, 


so the only critical point is the origin (0, 0), where the value of the function is zero. 
Because f(x, y) > 0, this point is a relative minimum— in fact, an absolute, or global, 
minimum— of f. Because (0, 0) is the only critical point, there are no maxima. A 


Consider the function f: R? — R, (x, y) => x? — y?. Ignoring for the moment that this 
function has a saddle and no extrema, apply the method of Theorem 4 for the location 
of extrema. 


Asin Example 1, we find that f has only one critical point, at the origin, and the value 
of f there is zero. Examining values of f directly for points near the origin, we see 
that f(x,0) > f(0, 0) and f(0, y) < f(0, 0), with strict inequalities when x 4 0 and 
y + 0. Because x or y can be taken arbitrarily small, the origin cannot be either a relative 
minimum or a relative maximum (so it is a saddle point). Therefore, this function can 
have no relative extrema (see Figure 3.3.3). 


Graph of t 


figure 3.3.3 A function of two variables with 
a saddle point. A 


Find all the critical points of z = x*y + y?x. 


Differentiating, we obtain 


Oz ðZ 
—=2 —=2 x. 
Ox ee oy xy + 


Equating the partial derivatives to zero yields 


2xy +y?=0, 2xy+x?=0. 


example 4 


solution 
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Subtracting, we obtain x? = y?. Thus, x = y. Substituting x = +y in the first of the 
two preceding equations, we find that 


2y? i y? = 3y? = 0, 
so that y = 0 and thus x = 0. If x = —y, then 


—2y? + y? - -y? =0, 
so y = 0 and therefore x = 0. Hence, the only critical point is (0, 0). For x = y, 
z = 2x3, which is both positive and negative for x near zero. Thus, (0, 0) is not a 
relative extremum. A 


2 


Refer to Figure 3.3.4, a computer-drawn graph of the function z = 2(x? + y?) e=% =$, 
W here are the critical points? 


figure 3.3.4 The volcano: z = 2(x? + y?) exp (— x? — y?), 


Because z = 2(x? + y2)e-*’-”, we have 


OZ 


x= 4x(e™™ =) + 2(x2 + y2)e-*’-Y"( 2x) 
=e-Y"[ax — Ax(x? + y?)] 


= 4x(e*’-Y’)(1 — x? — y?) 
and 
Oz 


E Aye Vl = x? — y?). 


These both vanish when x = y = 0 or when x? + y? = 1. This is consistent with the 
figure: Points on the crater’s rim are maxima and the origin is a minimum. A 


Second-Derivative Test for Local Extrema 


The remainder of this section is devoted to deriving a criterion, depending on the second 
derivative, for a critical point to be a relative extremum. In the special case n = 1, our 
criterion will reduce to the familiar condition from one-variable calculus: f”(xo) > 0 
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fora minimum and f (Xo) < 0 for a maximum. But in the general context, the second 
derivative is a fairly complicated mathematical object. To state our criterion, we will 
introduce a version of the second derivative called the Hessian, which in turn is related 
to quadratic functions. Quadratic functions are functions g: R" —> R that have the 
form 


n 
g(hi, ..., hn) = 5 aijhihj 
i,j=1 
foran n x n matrix [aij]. In terms of matrix multiplication, we can write 
ai a12 +e An hi 


glhir... An) = [hiha] 


anı an2 > Ann hn 


For example, if n = 3, 


1 -1 0] fh: 
g(hy, hz, h3) = h? — 2hıh2 + hå = [hı hzh;3] | -1 0 0| | hy 
0 0 1| lh; 


is a quadratic function. 

Wecan, if wewish, assumethat[a;j] is symmetric; in fact, g is unchanged if we replace 
[ai] by the symmetric matrix [bj], where by = (aj + aji), because hih; = hjhj and 
the sum is over all i and j. The quadratic nature of g is reflected in the identity 


g(Ahy,..., Ahn) = Ag (hi, ..., Nn), 
which follows from the definition. 


Now we are ready to define Hessian functions (named after L udwig Otto Hesse, who 
introduced them in 1844). 


Definition Suppose that f: U c R" > R has second-order continuous deriva- 
tives (0°f/ əxi 9x;)(%), fori, j =1,...,n, ata point X € U. The Hessian of 
f at x is the quadratic function defined by 


n 2 
Hf (a)(I = 5 > ehhh 
RE 


ƏXi OX; 
a°f af 
1 0X1 0X1 0X1 OX hy 
= lhu] | 
a f aei hh 
OXn 0X1 OXn OXn 


Notice that, by equality of mixed partials, the second-derivative matrix is 
symmetric. 
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This function is usually used at critical points % € U . In this case, Df (%)) = O so 
the Taylor formula (see Theorem 2, Section 3.2) may be written in the form 


Thus, at a critical point the Hessian equals the first nonconstant term in the Taylor 
series of f. 

A quadratic function g: R” —> Riscalled positive-definite if g(h) > 0 forall he R" 
and g(h) = 0 only for h = Q Similarly, g is negative-definite if g(h) < 0 and g(h) = 0 
for h= Oonly. Note that if n = 1, Hf (Xo)(h) = 5 f”(xo)h?, which is positive-definite 
if and only if f’”(Xo) > 0. 


Theorem 5 Second-Derivative Test for Local Extrema If f:U c R! > 


Risof classC 7, X €U isacritical point of f, otal 


Actually, we shall prove that the extrema given by this criterion are strict. A relative 
maximum % is said to be strict if f(x) < f(x) for nearby x 4 X. A Strict relative 
minimum is defined similarly. Also, the theorem is valid even if f is only C2, but we 
have assumed C ? for simplicity. 

The proof of Theorem 5 requires Taylor’s theorem and the following result from 
linear algebra. 


Lemma | If B = [bj] is ann x n real matrix, and if the associated quadratic 
function 


n 
H: R" —> R, (hy, ..., hy) [= Z J baht 
i= 


is positive-definite, then there is a constant M > 0 such that for all he R’; 


H (h) > M hy’. 


proof For |h = 1, set g(h) = H (h). Then g is a continuous function of h for 
|| hi) = 1 and so achieves a minimum value, say M .ĉ Because H is quadratic, we have 


H(h) =H (ah) =H (aa) Ih? = (ar) 


for any h # Q (The result is obviously valid if h= 0) m 


Ih? > M Jih]? 


N otethat the quadratic function associated with the symmetric matrix 5(07f /axi xj) 
is exactly the Hessian. 


5H ere we are using, without proof, a theorem analogous to a theorem in calculus that states that every 
continuous function on an interval [a, b] achieves a maximum and a minimum; see Theorem 7. 
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proof of theorem 5 Recall thatif f: U c R” > R is of class C 3 and % € U isa 
critical point, Taylor's theorem may be expressed in the form 


f(x + h) — f(x) = Hf %)(h) + R2(%, h), 


where (R2(%, h))/||h||? — 0 as h> O 
Because Hf (%) is positive-definite, Lemma 1 assures us of a constant M > 0 such 
that for all he R" 


Hf (x) (h) > M |i’. 
Because R>(X, h) /||hij? — 0 as h— Q there is ad > 0 such that for 0 < ||| < 65 
|Ro(%, h)| < M Iihļ?. 
Thus, 0 < Hf (%)(h) + Ro(%, h) = f(x + h) — f (x) for 0 < ihj] < 8, so that X is 
a relative minimum; in fact, a strict relative minimum. 


The proof in the negative-definite case is similar, or else follows by applying the 
preceding to — f , and is left as an exercise. E 


Consider again the function f: R? —> R, (x, y) > x? + y?. Then (0, 0) is a critical 
point, and f is already in the form of Taylor’s theorem: 


f((0, 0) + (hy, h2)) = f (0, 0) + (h? + h3) +0. 
We can see directly that the Hessian at (0, 0) is 
Hf(O(h) =h? + h3, 


which is clearly positive-definite. Thus, (0, 0) is a relative minimum. This simple case 
can, of course, be done without calculus. Indeed, itis clear that f(x, y) > 0 for all 
(x,y) # (0,0). 4 


For functions of two variables f(x, y), the Hessian may be written as follows: 


ai a’f | 
əx? ayax | [hy 
=l 
troen = ameta mr att | | 
əxəy ay? 


Now we shall give a useful criterion for when a quadratic function defined by such a 
2 x 2 matrix is positive-definite. This will be useful in conjunction with Theorem 5. 


Lemma 2 Let 


h 
ras d and H (h) = 3{hi, h2]B fal 


Then H (h) is positive-definite if and only if a > 0 and det B = ac — b? > 0. 
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proof Wehave 


Hah? + 2bhyh +ch3). 


L et us complete the square, writing 


1 bo? 1 bN a 


Suppose H is positive-definite. Setting hə = 0, weseethata > 0. Setting hı =—(b/a)hy, 
we get c — b? /a > 0 or ac — b? > 0. Conversely, if a > 0 and c — b? /a > 0, H (h) isa 
sum of squares, so that H (h) > 0. If H (h) = 0, then each square must be zero. This 
implies that both hı and hz must be zero, so that H (h) is positive-definite. m 


Similarly, we can see that H (h) is negative-definite if and only if a <0 and 
ac — b? > 0. We note that an alternative formulation is that H (h) is positive- (respec- 
tively, negative-) definite if a + c = trace B > 0 (respectively, < 0) and det B > 0. 


Determinant Test for Positive Definiteness 


There are similar criteria to test the positive (or negative) definiteness of an n x n 
symmetric matrix B, thus providing a maxima and minima test for functions of n- 
variables. Consider the n square submatrices along the diagonal (see Figure 3.3.5). B is 
positive-definite (that is, the quadratic function associated with B is positive-definite) if 
and only if the determinants of these diagonal submatrices are all greater than zero. For 
negative-definite B , the signs should be alternately <0 and > 0. We shall not prove this 
general case here.’ In case the determinants of the diagonal submatrices are all nonzero, 
but the H essian matrix is not positive- or negative-definite, the critical point is of saddle 
type; in this case, we can show that the point is neither a maximum nor a minimum in 
the manner of Example 2. 


[| 


figure 3.3.5 “Diagonal” submatrices are used in the criterion 
for positive definiteness; they must all have determinant > 0. 


'This is proved in, for example, K. Hoffman and R. K unze, Linear Algebra, Prentice Hall, Englewood 
Cliffs, N.J., 1961, pp. 249-251. For students with sufficient background in linear algebra, it should be 
noted that B is positive-definite when all of its eigenvalues (which are necessarily real, because B is 
symmetric) are positive. 
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solution 


General Second-Derivative Tests (n-variables) 


Suppose X% € R” is a critical point for a C? function f : U — R, U an open set 
containing X; that is, H X) = 0, i = 1,---,n. Suppose that the Hessian matrix 


(Fe (0 )} is positive-definite; then x is a strict local minimum for f. If the Hessian 
matrix is negative-definite, X is a strict local maximum. If the Hessian matrix is neither 
positive- nor negative-definite, but its determinant is nonzero, it is of saddle type (it is 
neither a maximum nor a minimum). If the determinant of the Hessian is zero, itis said 
to be of degenerate type and nothing can be said about the nature of the critical point 
without further analysis. Figure 3.3.5 illustrates a simple test for the positive definiteness 
of a symmetric matrix. In the case of two variables, the maximum and minimum test 
can be considerably simplified. 


Second-Derivative Test (two variables) 


Lemma 2 and Theorem 5 imply the following result: 


Theorem 6 Second-Derivative Maximum-Minimum Test for Functions 
of Two Variables Let f(x, y) be of class C? on an open setU in R2. A point 

(Xo, Yo) is a (strict) local minimum of f provided the following three conditions 

hold: 


te 


ay (Xo, Yo) = 0 


i? T Yo) > 0 


m af \ (af aft \? 
(iii) >= (2) (2) (say) > 0 at (Xo, Yo) 


(D is called the discriminant of the Hessian.) If in (ii) wehave <0 instead of 
>0 and condition (iii) is unchanged, then we have a (strict) local maximum. 


ID < 0 (eg, if È FF oo) = Or a 0, but 2%, yo) 


ay? OXOY 
0), then (Xo, Yo) is . saddle type (neither a maximum nor a minimum). 


Classify the critical points of the function f: R? — R, defined by (x, y) => x? — 
2xy + 2y?. 


Asin Example 5, we find that f(0, 0) = 0, the origin is the only critical point, and the 
Hessian is 
Hf(O(h) = hf — 2hıhz + 2h3 = (hy — h2)? +h}, 


whichis clearly positive-definite. Thus, f hasarelative minimum at (0, 0). Alternatively, 
we can apply Theorem 6. At (0, 0), 3?f /3x? = 2, a*f /ay* = 4, and a°f /ax ay = —2. 
Conditions (i), (ii), and (iii) hold, so f has a relative minimum at (0, 0). A 


example 7 


solution 
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If D < 0 in Theorem 6, then we have a saddle point. In fact, we can prove that 
f(x, y) is larger than f (Xo, Yo) as we move away from (Xo, Yo) in some direction and 
smaller in the orthogonal direction (see Exercise 32). The general appearance is thus 
similar to that shown in Figure 3.3.3. The appearance of the graph near (Xo, Yo) in the 
case D = 0 must be determined by further analysis. 

We summarize the procedure for dealing with functions of two variables: A fter all 
critical points have been found and their associated Hessians computed, some of these 
Hessians may be positive-definite, indicating relative minima; some may be negative- 
definite, indicating relative maxima; and some may be neither positive- nor negative- 
definite, indicating saddle points. The shape of the graph at a saddle point where D < 0 
is like that in Figure 3.3.3. Critical points for which D + 0 are called nondegenerate 
critical points. Such points are maxima, minima, or saddle points. The remaining critical 
points, where D = 0, may be tested directly, with level sets and sections or by some 
other method. Such critical points are said to be degenerate; the methods developed in 
this chapter fail to provide a picture of the behavior of a function near such points, so 
we examine them case by case. 


Locate the relative maxima, minima, and saddle points of the function 
f(x, y) = log (x? + y? +1). 


Wemustfirstlocate the critical points of this function; therefore, according to Theorem 3, 
we calculate 
2x . 2y x 
= i+ j. 
x? +y?+1 x+y? +1 


Vf (x, y) 


Thus, Vf (x, y) = Oif and only if (x, y) = (0, 0), and so the only critical point of f 
is (0, 0). Now we must determine whether this is a maximum, a minimum, or a saddle 
point. The second partial derivatives are 

OF  2(x? +y? +1) — (2x)(2x) 


Ox? (x? + y? +1) 
af  2(x? +y? +1) — (2y)(2y) 
əy? (x? + y? + 1)? ' 
and 
vf = —2x(2y) 
axay (x? +y? +1) 
Therefore, 
əf 3?f 3?f 
za 0) =2= TA 0) and ax ayo 0) = VU, 
which yields 
D=2-2=4>0. 


Because (df /3x?)(0, 0) > 0, weconclude by Theorem 6 that (0, 0) isalocal minimum. 
(Can you show this just from the fact that log t is an increasing function oft > 0?) A 
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example 8 


solution 


The graph of the function g(x, y) = 1/xy is a surface S in R3. Find the points on S that 
are closest to the origin (0, 0, 0). (See Figure 3.3.6.) 


d 
a 
GB 


>> 
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ZZ» 
ZEEE 


f 


ZZ 


a 
NS 


figure 3.3.6 The surface z = 1/xy defined over the first 
quadrant in the xy plane. (There are similar figures in the 
other quadrants, but notice that z < 0 in the second and 
fourth quadrants.) 


Each point on S is of the form (x, y, 1/xy). The distance from this point to the origin is 


1 
d(x, y) = RE 


Itis easier to work with the square of d, so let f (x, y) = x? +y? +(1/x?y?), which will 
have the same minimum point. This follows from the fact that d(x, y)? > d (Xo, Yo)? if 
and only if d(x, y) > d(Xo, Yo). Notice that f (x, y) becomes very large as x and y get 
larger and larger; f(x, y) also becomes very large as (x, y) approaches the x or y axis 
where f is not defined, so f must attain a minimum at some critical point. The critical 
points are determined by: 


ax x3y2 0, 
of 2 
ay y3x2 
that is, x*y? — 1 = 0, and x?y* — 1 = 0. From the first equation we get y? = 1/x4, and, 
substituting this into the second equation, we obtain 
x 1 
eX 
Thus, x = +1 and y = +1, and it therefore follows that f has four critical points, 
namely, (1, 1), (1, —1), (—1, 1), and (—1, —1). Note that f has the value 3 for all these 


points, so they are all minima. Therefore, the points on the surface closest to the point (0, 
0, 0) are (1, 1, 1), (1, —1, —1), (—1, 1, —1), and (—1, —1, 1) and the minimum distance 
is /3. Is this consistent with the graph in Figure 3.3.6? A 


example 9 


solution 


example 10 


solution 
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Analyze the behavior of z = x°y + xy? + xy at its critical points. 


The first partial derivatives are 


OZ 
ety ty ay tye) 


and 


OZ 
— = x(5yf + xf + 1). 
ay (5y* +x" +1) 
The terms 5x* + y + 1 and 5y* + x4 + 1 are always greater than or equal to 1, and so 
it follows that the only critical point is (0, 0). 

The second partial derivatives are 


az 3 az 
ae 20x?y, T 20xy 
and 
: 4 4 
= 1. 
TE 5x* + 5y" + 


Thus, at(0, 0), D = —1, and so (0, 0) is anondegenerate saddle point and the graph of 
z near (0, 0) looks like the graph in Figure 3.3.3. A 


We now look at an example for a function of three variables. 


Consider f(x, y) =x? + y? +z? + 2xyz. Show that (0, 0, 0) and (—1, 1, 1) are both 
critical points. Determine whether they are local minima, local maxima, saddle points, 
or none of them. 

ar = 2x + 2yz, © = 2y + 2xz, and 4 = 2z + 2xy, all of which vanish at (0, 0, 0) 
and (—1, 1, 1). Thus, these are critical points. The Hessian of f at (0,0, 0) is 


2 0 0 

0 2 0]. 

0 0 2 

2 0 
0 2 
positive determinants. Therefore (c.f. Figure 3.3.5) (0, 0, 0) is a strict local minimum. 
On the other hand, the Hessian matrix of f at(—1, 1, 1) is 


The diagonal submatrices are [2] and and the Hessian itself, all of which have 


af 3?f ə?f 


əx? AXdY əxəðzZ 
ə? f af əf 
ayox əy?  ayaz 


ai 3?f əf 
əzəx dzdy az? 
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or 


2 2 2 
2 2 -2]|. 
2-2 2 


The determinant of the first diagonal matrix is 2, the second diagonal matrix 


2 2 

2 2 
is zero, and the determinant of the Hessian is —16. Thus, the critical point (—1, 1, 1) is 
of saddle type (i.e, neither a maximum nor a minimum). A 


Global Maxima and Minima 


We end this section with a discussion of the theory of absolute, or global, maxima 
and minima of functions of several variables. Unfortunately, the location of absolute 
maxima and minima for functions on R” is, in general, a more difficult problem than 
for functions of one variable. 


Definition Suppose f: A — Risa function defined on a set A in R? or R?. A 
point X € A is said to be an absolute maximum (or absolute minimum) point of 
f if f(x) < f(x) [or f(x) > f(x)] forall xe A. 


In one-variable calculus, we learn— but often do not prove— that every continu- 
ous function on a closed interval | assumes its absolute maximum (or minimum) 
value at some point X in |. A generalization of this theoretical fact also holds in R". 
Such theorems guarantee that the maxima or minima one is seeking actually exist; 
therefore, the search for them is not in vain. 


Definition A set D € R” is said to be bounded if thereisa number M > Osuch 
that |x| < M for all xe D.A setis closed if it contains all its boundary points. 

As an important example, we note that the level sets {(X1, X2,---, Xn) If (X1, 
X2,..-,Xn) = C} of a continuous function f are always closed. 


Thus, asetis bounded if itcan be strictly contained in some (large) ball. The appropri- 
ate generalization of the one-variable theorem on maxima and minima is the following 
result, stated without proof. 


Theorem 7 Global Existence Theorem for Maxima and Minima Let 
D beclosed and bounded in R” and let f: D —> Rbecontinuous. Then f assumes 
its absolute maximum and minimum values at some points X and x of D. 
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D =U UaU 


figure 3.3.7 D = U u 3U: Two 

examples of regions whose au 
boundary is a piecewise smooth 

curve. 


Simply stated, X and X are points where f assumes its largest and smallest values. 
Asin one-variable calculus, these points need not be uniquely determined. 

Suppose now that D = U U dU, where U is open and aU is its boundary. If 
D c R?, we suppose that ðU is a piecewise smooth curve; that is, D is a region 
bounded by a collection of smooth curves— for example, a square or the sets depicted in 
Figure 3.3.7. 

If X and x are in U , we know from Theorem 4 that they are critical points of f. 
If they are in 3U , and aU is a smooth curve (i.e., the image of a smooth path c with 
c + 0), then they are maximum or minimum points of f viewed as a function on aU . 
These observations provide a method of finding the absolute maximum and minimum 
values of f onaregionD. 


Strategy for Finding the Absolute Maxima and Minima on a 
Region with Boundary Let f be a continuous function of two variables 
defined on a closed and bounded region D in R?, which is bounded by a smooth 
closed curve. To find the absolute maximum and minimum of f on D: 


(i) Locate all critical points for f inU. 

(ii) Find all the critical points of f viewed as a function only on aU . 
(iii) Compute the value of f at all of these critical points. 
(iv) Compare all these values and select the largest and the smallest. 


If D is aregion bounded by a collection of smooth curves (such as a square), then we 
follow a similar procedure, but including in step (iii) the points where the curves meet 
(such as the corners of the square). 

All the steps except step (ii) should now be familiar to you. To carry out step (ii) 
in the plane, one way is to find a smooth parametrization of aU ; that is, we find a 
path c: | — 3U, where | is some interval, which is onto aU . Second, we consider 
the function of one variable t > f(dt)), wheret €e I, and locate the maximum and 
minimum points to, tı € | (remember to check the endpoints!). Then dto), dt:) will 
be maximum and minimum points for f as a function on aU. Another method for 
dealing with step (ii) is the Lagrange multiplier method, to be presented in the next 
section. 
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example 1] Find the maximum and minimum values of the function f(x, y) = x?+y?—x-—y+l1 
in the disc D defined by x? + y? < 1. 


solution (i) To find the critical points we set of /ax = af/ay = 0. Thus, 2x — 1 = 0, 
2y — 1 = 0, and hence (x, y) = (4, 5) is the only critical point in the open disc 
U = {(x, y) |x? +y? < 1}. 


(ii) The boundary dU can be parametrized by qt) = (sint, cost), 0 < t < 2x. Thus, 
f(dt)) = sin? t + cos? t — sint — cost + 1 = 2 — sint — cost = g(t). 


To find the maximum and minimum of f on dU , it suffices to locate the maxi- 
mum and minimum of g. Now g'(t) = 0 only when 


m 5r 


sint = cost, that is, when t=—, —. 
4 4 


Thus, the candidates for the maximum and minimum for f on aU are the points 
7/4), (5/4), and the endpoints d0) = d2z). 


(iii) The values of f at the critical points are: f(5, 5) = 5 from step (i) and, from 


step (ii), 
(43))=(8 9) 
ees eee eee 
and 


f(d0)) = f(e27)) = f(0, 1) =1. 


(iv) Comparing all thevalues 5, 2—./2, 2+/2, 1, itis clear that the absolute minimum 
is 4 and the absolute maximum is 2 + J/2. A 


In Section 3.4, we shall consider a generalization of the strategy for finding the 
absolute maximum and minimum to regions D in R". 


exercises 


In Exercises 1 to 16, find the critical points of the given function and then determine whether they are local maxima, local 
minima, or saddle points. 


L f(x,y) =x2-y2+xy 6. f(x, y) =x? — 3xy + 5x — 2y + 6y? + 8 

2 f(x,y) =x2 4 y? —xy 7. f(x, y) = 3x2 + 2xy+2x+y?+y+4 

3. f(x, y) =x? +y? + 2xy 8. f(x, y) = sin(x? + y?) [consider only the critical point 
(0, 0)] 


4 f(x,y) =x? +y? + 3xy 
9. f(x, y) = cos(x? + y?) [consider only the three critical 
5. f(x, y) =elt-y’ points (0, 0), (./7/2, /x/2), and (0, /7)] 


f(x,y) =y4+xsiny 

f(x, y) =e* cosy 

f(x, y) = (x — y)(xy — 1) 
el 

eE 

f(x, y) = log (2 + sin xy) 


f(x, y) =xsiny 


Shs REB 


f(x, y) =(x + y)(xy + J) 
17. Find all local extrema for f(x, y) = 8y? + 12x? — 24xy. 


18. Let f(x, y, Z) = X? + y? +z? + kyz. 


(a) Verify that (0, 0, 0) is a critical point for f. 


(b) Find all values of k such that f has a local 
minimum at (0, 0, 0). 


19. Find and classify all critical points of 
f(x, y) = 3x3 + fy? — $x? — 3y? + 6y + 10. 


20. Suppose (4, 2) is acritical point for the C 2 function 
f(x, y). In each case, determine whether (4, 2) is a local 
maximum, a local minimum, or a saddle point. 


(a) fyx (4, 2) = 1, fxy(4, 2) = 3; fyy — 5 
(b) fxx(4,2)=2, fyx(4,2)=—1, fy =4 
(c) fxx(4, 2) = —2, fxy(4, 2) = 1, fyy = 3 


2L Find the local maxima and minima for 
Zz = (x2 + 3y?) el-*’-Y", (See Figure 2.1.15.) 


22. Let f(x, y) =x? + y? + kxy. If you imagine the graph 
changing as k increases, at what values of k does the 
shape of the graph change qualitatively? 


23. An examination of the function 
f: R? > R, (x, y) = (y — 3x2)(y — x2) will give an 
idea of the difficulty of finding conditions that guarantee 
that a critical point is a relative extremum when 
Theorem 6 fails. Show that 


(a) the origin is a critical point of f; 


(b) f has a relative minimum at (0, 0) on every straight 
line through (0, 0); that is, if g(t) = (at, bt), then 
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f og: R > R has a relative minimum at 0, for 
every choice of a and b; 


(c) the origin is not a relative minimum of f. 


24. Let f(x, y) = Ax? + E, where A and E are constants. 
What are the critical points of f? Are they local maxima 
or local minima? 


25. Let f(x, y) =x? —2xy + y?. Here D = 0. Can you say 
whether the critical points are local minima, local 
maxima, or saddle points? 


26. Let f(x, y) = ax? + bx?, wherea, b # 0. 


(a) Show that (0, 0) is the only critical point for f. 


(b) Determine the nature of this critical point in terms 
of a and b. 


27. Suppose f: R? + R is C?, and that » is a critical point 
for f. Suppose Hf (xo)(h) = h? +h3 +h? + 4h2h3. 
Does f have a local maximum, minimum, or saddle at 
x? 


28. Find the point on the plane 2x — y + 2z = 20 nearest the 
origin. 


29. Show that a rectangular box of given volume has 
minimum surface area when the box is a cube. 


30. Show that the rectangular parallelepiped with fixed 
surface area and maximum volume is a cube. 


3L Write the number 120 as a sum of three numbers so that 
the sum of the products taken two at a time is a 
maximum. 


32. Show that if (xo, yo) is a critical point of a quadratic 
function f(x, y) and D < 0, then there are points (x, y) 
near (Xo, Yo) at which f(x, y) > f(Xo, yo) and, 
similarly, points for which f(x, y) < f (Xo, Yo). 


33. Let f(x, y) =x®4+x24y® g(x, y) = 
—~x6 — x? — yê, h(x, y) = xf — x4 + yê. 
(a) Show that (0, 0) is a degenerate critical point for 
f,g, and h. 


(b) Show that (0, 0) is a local minimum for f, a local 
maximum for g, and a saddle for h. 


BA. Let f(x, y) =5ye* — e” — y5, 


8T his interesting phenomenon was first pointed out by the famous mathematician Giuseppe Peano (1858- 1932). Another curious “pathology” 


is given in Exercise 41. 
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(a) Show that f has a unique critical point and that this 
pointis alocal maximum for f. 


(b) Show that f is unbounded on the y axis, and thus 
has no global maximum. [Note that for a function 
g(x) of a single variable, a unique critical point 
which is a local extremum is necessarily a global 
extremum. This example shows that this is not the 
case for functions of several variables. ] 


35. Determine the nature of the critical points of the function 
f(x, y, Z) =x? +y? +z? + xy. 


36. Letn bean integer greater than 2 and set f (x, y) = 
ax" + cy", where ac + 0. Determine the nature of the 
critical points of f. 


37. Determine the nature of the critical points of 
f(x, y) = x3 + y? — 6xy + 6x + 3y. 


38. Find the absolute maximum and minimum values of the 
function f(x, y) = (x2 + y?)4 on the disc x? + y? < 1. 
(You do not have to use calculus.) 


39. Repeat Exercise 38 for the function 
f(x, y) =x? + xy + y2. 


In Exercises 46 through 50, D denotes the unit disc. 


46. Letu be aC? function on D which is “strictly 
subharmonic”; that is, the following inequality holds: 
V2u = (d2u/ax2) + (a2u/ay2) > 0. Show that u 
cannot have a maximum point in D \aD (the set of 
points in D, butnotin aD). 


47. Letu bea harmonic function on D — that is, V2u = 0 on 
D \aD — and be continuous on D . Show that if u 
achieves its maximum value in D \aD , it also achieves it 
on aD. This is sometimes called the “weak maximum 
principle” for harmonic functions. [HINT: Consider 
V?(u + £e*), e > 0. You can use the following fact, 
which is proved in more advanced texts: Given a 
sequence {ph}, n =1, 2,..., of points in a closed 
bounded set A in R? or R3, there exists a point q such 
that every neighborhood of q contains at least one 
member of {p }.] 


48. Define the notion of a strict superharmonic function u 
on D by mimicking Exercise 46. Show that u cannot 
havea minimum in D\aD. 


49. Letu be harmonic in D as in Exercise 47. Show that if u 
achieves its minimum value in D \aD, it also achieves it 


40. A curveC in space is defined implicitly on the cylinder 
x2 + y? = 1 by the additional equation 
x? — xy + y? — z? = 1. Find the point or points on C 
closest to the origin. 


AL Find the absolute maximum and minimum values for 
f(x, y) = sinx + cos y on the rectangle R defined by 
0<x<27,0<y<d2z. 


42. Find the absolute maximum and minimum values for the 
function f(x, y) = xy on the rectangle R defined by 
—-1<x<1l1,-l<y<l. 


43. Let f(x, y) =1+xy — 2x + y and let D be the 
triangular region in R? with vertices (—2, 1), (—2, 5), 
and (2, 1). Find the absolute maximum and minimum 
values of f on D. Give all points where these extreme 
values occur. 


44A. Let f(x,y) =14+ xy +x — 2y and let D be the 
triangular region in R? with vertices (1, —2), (5, —2), 
and (1, 2). Find the absolute maximum and minimum 
values of f on D. Giveall points where these extreme 
values occur. 


45. Determine the nature of the critical points of 
f(x, y) =xy+1/x+8/y. 


on aD. This is sometimes called the “weak minimum 
principle” for harmonic functions. 


50. Let: 3D — R be continuous and let T be a solution 
on D to VT = 0, continuous on D and T = ¢ on ðD. 


(a) Use Exercises 46 to 49 to show that such a solution, 
if it exists, must be unique. 


(b) Suppose that T (x, y) represents a temperature 
function that is independent of time, with @ 
representing the temperature of a circular plate at its 
boundary. Can you give a physical interpretation of 
the principle stated in part (a)? 


5L (a) Let f beaC! function on the real line R. Suppose 
that f has exactly one critical point Xo that is a strict 
local minimum for f. Show that xo is also an 
absolute minimum for f; thatis, that f(x) > f (xo) 
forall x. 


(b) The next example shows that the conclusion of part 
(a) does not hold for functions of more than one 
variable. Let f: R? — R be defined by 


~ x2 


f(x,y) =—y*—e-* + 2y? Vex +e”, 
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(i) | Show that (0, 0) is the only critical point for f 
and that it is alocal minimum. 

(ii) Argue informally that f has no absolute 
minimum. 


52. Suppose that a pentagon is composed of a rectangle 
topped by an isosceles triangle (see Figure 3.3.8). If the 
length of the perimeter is fixed, find the maximum 
possible area. 


figure 3.3.8 Maximize the area for fixed 
perimeter. 


3.4 Constrained Extrema and Lagrange Multipliers 


Often we are required to maximize or minimize a function subject to certain constraints 
or side conditions. For example, we might need to maximize f(x, y) subject to the 
condition that x? + y? = 1; that is, that (x, y) lie on the unit circle. M ore generally, we 
might need to maximize or minimize f(x, y) subject to the side condition that (x, y) 
also satisfies an equation g(x, y) = c, where g is some function and c equals a constant 
[in the preceding example, g(x, y) = x? + y?, and c = 1]. The set of such (x, y) isa 
level curve for g. 

The purpose of this section is to develop some methods for handling this sort of 
problem. In Figure 3.4.1 we picture a graph of a function f(x, y). In this picture, the 
maximum of f might be at (0, 0). However, suppose we are not interested in this 
maximum but only the maximum of f(x, y) when (x, y) belongs to the unit circle; that 
is, when x? +y? = 1. Thecylinder over x?+ y? = 1 intersects the graph of z = f(x, y) 
in a curve that lies on this graph. The problem of maximizing or minimizing f(x, y) 
subject to the constraint x? + y? = 1 amounts to finding the point on this curve where 
z is the greatest or the least. 


The Lagrange Multiplier Method 


In general, let f:U CIR" + Randg:U c R" —> R begiven C t functions, and let S be 
the level set for g with value c [recall that this is the set of points x € R” with g(x) = c]. 


Z 


4¿=t{x, y) subject 
to the constraint 
x? + y? =1 


figure 3.4.1 The geometric meaning 
of maximizing f subject to the 
constraint x? + y? = 1. 


1 
I 
i 
1 
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Point on x*+y2=1 
where t is maximized 
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Higher-Order Derivatives: Maxima and Minima 


When f is restricted to S we again have the notion of local maxima or local minima 
of f (local extrema), and an absolute maximum (largest value) or absolute minimum 
(smallest value) must be a local extremum. The following method provides a necessary 
condition for a constrained extremum: 


Theorem 8 The Method of Lagrange Multipliers Suppose that f: U c 
R” + Randg: U c R” — Rare given C1 real-valued functions. Let % € U 
and g(%)) = c, and let S be the level set for g with value c [recall that this is the 
set of points x€ R” satisfying g(x) = c]. Assume Vg (%) 4 O 

If f |S, which denotes “if restricted to S,” has a local maximum or minimum 
on S at %, then there is a real number à (which might be zero) such that 


Vf (x) = AVG(%). (1) 


In general, a point % Where equation (1) holds is said to be a critical point 
of f|S. 


proof We have not developed enough techniques to give a complete proof, but we 
can provide the essential points. (The additional technicalities needed are discussed in 
Section 3.5 and in the Internet supplement.) 

In Section 2.6 we learned that for n = 3 the tangent space or tangent plane of S at 
X is the space orthogonal to Vg(%). For arbitrary n we can give the same definition for 
the tangent space of S at X. This definition can be motivated by considering tangents 
to paths dt) that liein S, as follows: If dt) is a path in S and d0) = x, then ¢(0) isa 
tangent vector to S at X, but 


d d 
T = Ge =o 


and, on the other hand, by the chain rule, 


dt 
so that Vg (xo) - ¢(0) = 0; that is, c (0) is orthogonal to Vg (xo). 


If f IS has a maximum at x, then f(dt)) has a maximum att = 0. By one-variable 
calculus, df (c{t)) /dt|t2o = 0. Hence, by the chain rule, 


d 
~ dt 
Thus, Vf (Xo) is perpendicular to the tangent of every curvein S and so is perpendicular 
to the whole tangent space to S at X. Because the space perpendicular to this tangent 
space is a line, Vf(%) and Vg(%) are parallel. Because Vg(%) + Q it follows that 
Vf (Xo) isa multiple of Vg(x%), which is the conclusion of the theorem. m 


0 F(dt))| = vf (%) - c(0). 


t=0 


Let us extract some geometry from this proof. 


Theorem 9 If f, when constrained to a surface S, has a maximum or minimum 
at X, then Vf (xX) is perpendicular to S at X (see Figure 3.4.2). 


figure 3.4.2 The geometry of 
constrained extrema. 
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# gradt(xo, yo Zo) = Vt (Xo, Yor 20) 


Plane tangent to S 


Surface S 


These results tell us that in order to find the constrained extrema of f, we must 
look among those points X satisfying the conclusions of these two theorems. We 
shall give several illustrations of how to use each. 

When the method of Theorem 8 is used, we look for a point X and a constant 
à, called a Lagrange multiplier, such that Vf (xX) = AVg(X). This method is more 
analytic in nature than the geometric method of Theorem 9. Surprisingly, Euler intro- 
duced these multipliers in 1744, some 40 years before Lagrange! 

Equation (1) says that the partial derivatives of f are proportional to those of g. 
Finding such points X at which this occurs means solving the simultaneous equations 


of ag 

oe sue Mil E ah weer Xn) 

of ag 

—( 1, Xn) = A—(X1, Xn) 

on 3X2 (2) 
af ag 

ay, Ol Xn) =A e Xn) 

(Xir... Xn) =C 


fOr X1, ..., Xn anda. 
Another way of looking at these equations is as follows: Think of à as an additional 


variable and form the auxiliary function 
W(X1,.--, Xn, A) = f (Xi... Xn) — ALQ(Xa, -3 Xn) — C]. 


The Lagrange multiplier theorem says that to find the extreme points of f |S, we should 
examine the critical points of h. These are found by solving the equations 


OXn OXn OXn 
ah 
= — =9(X1,.-., Xn) — 
0 DA g( 1 n) c 


which are the same as equations (2) above. 
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Higher-Order Derivatives: Maxima and Minima 


example | 


solution 


example 2 


solution 


Second-derivative tests for maxima and minima analogous to thosein Section 3.3 will 
be given in Theorem 10 later in this section. H owever, in many problems itis possible to 
distinguish between maxima and minima by direct observation or by geometric means. 
Because this is often simpler, we consider examples of the latter type first. 


Let S c R? be the line through (—1, 0) inclined at 45°, and let f: R? > R, (x, y) => 
x? + y?. Find the extrema of f |S. 


Here S = {(x,y) | y —x —1 = 0}, and therefore we set g(x,y) = y—x-1 
and c = 0. We have Vg(x, y) = —i + j 4 O The relative extrema of f|S must be 
found among the points at which Vf is orthogonal to S; that is, inclined at —45°. But 
Vf (x, y) = (2x, 2y), which has the desired slope only when x = —y, or when (x, y) 
lies on the line L through the origin inclined at —45°. This can occur in the set S only 
for the single point at which L and S intersect (see Figure 3.4.3). Reference to the level 
curves of f indicates that this point, (—1/2, 1/2), is a relative minimum of f |S (but 
not of f). 
Notice that in this problem, f on S has a minimum but no maximum. 


figure 3.4.3 The geometry associated with finding the 
extrema of f(x, y) = x? + ¥ restricted to 
S={%YWly-x-1=0}. A 


Let f: R? > R, (x, y) & x? — y?, and let S bethe circle of radius 1 around the origin. 
Find the extrema of f |S. 


The set S is the level curve for g with value 1, where g: R? > R, (x, y) œ> Xx? + y?. 
Because both of these functions have been studied in previous examples, we know their 
level curves; these are shown in Figure 3.4.4. In two dimensions, the condition that 
Vf = Avg at x— that is, that Vf and Vg are parallel at x)— is the same as the level 
curves being tangent at X (why?). Thus, the extreme points of f|S are (0, +1) and 
+1, 0). Evaluating f, we find (0, +1) are minima and (1, 0) are maxima. 

Let us also do this problem analytically by the method of Lagrange multipliers. 
Clearly, 


~ 


vf(x,y)= (> =) =(2x,—2y) and Vg(x, y) = (2x, 2y). 


example 3 


solution 
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figure 3.4.4 The geometry associated with the 
problem of finding the extrema of x? — y? on 
Sax Yl e+y =}. 


Note that Vg(x, y) # Oif x? + y? = 1. Thus, according to the Lagrange multiplier 
theorem, we must find a A such that 


(2x, —2y) = A(2x,2y) and (x,y)eS, ie, x*+y?=1. 


These conditions yield three equations, which can be solved for the three unknowns 
X, y, and A. From 2x = 42x, we conclude that either x = 0 or à = 1. If x = 0, then 
y = +1 and —2y = A2y implies à = —1. If à = 1, then y = 0 and x = +1. Thus, 
we get the points (0, +1) and (+1, 0), as before. As we have mentioned, this method 
only locates potential extrema; whether they are maxima, minima, or neither must be 
determined by other means, such as geometric arguments or the second-derivative test 
given below.2 A 


M aximize the function f(x, y, z) = x + z subject to the constraint x? + y? + z? = 1. 


By Theorem 7 weknow that the function f restricted to the unit sphere x? + y? +z? =1 
has a maximum (and also a minimum). To find the maximum, we again use the L agrange 
multiplier theorem. We seek à and (x, y, z) such that 


1=2xA, 0=2yà, and 1=22), 
and 
xy? 427 = 1. 


From the first or the third equation, we see that à + 0. Thus, from the second equation, 
we get y = 0. From the first and third equations, x = z, and so from the fourth, 
x = +1//2 = z. Hence, our points are (1/./2, 0, 1/./2) and (—1//2, 0, —1/./2). 
Comparing the values of f at these points, we can see that the first point yields the 
maximum of f (restricted to the constraint) and the second the minimum. A 


?In these examples, Vg(Xo) # Oon the surface S, as required by the Lagrange multiplier theorem. If 
Vg(%) were zero for some x on S, then it would have to be included among the possible extrema. 
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Higher-Order Derivatives: Maxima and Minima 


example 4 


solution 


Assume that among all rectangular boxes with fixed surface area of 10 square meters 
there is a box of largest possible volume. Find its dimensions. 


If x, y,andz arethelengths of thesides, x > 0, y > 0, z > 0, respectively, and the volume 
is f(x, y, Z) = xyz. The constraintis 2(xy +xz + yz) = 10; thatis, xy +xz + yz = 5. 
Thus, the Lagrange multiplier conditions are 


yz =A(y +2) 
XZ = A(X +2) 
xy =A(y +x) 


Xy+xXZ+yz=5. 


First of all, x + 0, because x = 0 implies yz = 5 and 0 = åz, so that à = 0 and we 
get the contradictory equation yz = 0. Similarly, y 4 0, z 40,x+y 40. Elimination 
of à from the first two equations gives yz/( y + z) = xz/(x + z), which gives x = y; 
similarly, y = z. Substituting these values into the last equation, we obtain 3x? = 5, 
or x = „5/3. Thus, we get the solution x = y = z = /5/73, and xyz = (5/3)3”. 
This (cubical) shape must therefore maximize the volume, assuming there is a box of 
maximum volume. A 


Existence of Solutions 


We should note that the solution to Example 4 does not demonstrate that the cube is 
the rectangular box of largest volume with a given fixed surface area; it proves that the 
cube is the only possible candidate for a maximum. We shall sketch a proof that it really 
is the maximum later. The distinction between showing that there is only one possible 
solution to a problem and that, in fact, a solution exists is a subtle one that many (even 
great) mathematicians have overlooked. 

Queen Dido (ca. 900 B.c.) realized that among all planar regions with fixed circum- 
ference, the disc is the region of maximum area. Itis not difficult to prove this fact under 
the assumption that there is a region of maximum area; however, proving that such a 
region of maximum area exists is quite another (difficult) matter. A complete proof was 
not given until the second half of the nineteenth century by the German mathematician 
Weierstrass. 

L et us consider a nonmathematical parallel to this situation. Put yourself in the place 
of Lord Peter Wimsey, Dorothy Sayers’ famous detective: 


“Undoubtedly,” said Wimsey, “but if you think that this identification is going to make life 
one grand, sweet song for you, you are mistaken. ... Since we have devoted a great deal of 
time and thought to the case on the assumption that it was murder, it’s a convenience to 
know that the assumption is correct.” 10 


Wimsey has found the body of a dead man, and after some time has located ten 
suspects. H eis sure that no one else other than one of the suspects could be the murderer. 
By collecting all the evidenceand checking alibis, hethen reduces the number of suspects 
one by one, until, finally, only the butler remains; hence, he is the murderer! But wait, 
Peter is a very cautious man. By checking everything once again, he discovers that the 
man died by suicide; so there is no murder. You see the point: It does not suffice to find 
a clear and uniquely determined suspect in a criminal case where murder is suspected; 
you must prove that a murder actually took place. 

The same goes for our cube; the fact that it is the only possible candidate for a max- 
imum does not prove that it is maximum. (For more information see The Parsimonious 


10D orothy L. Sayers, Have His Carcase, Chapter 31: The Evidence of the Haberdasher’s A ssistant, 
New York, Avon Books, 1968, p. 312. 


example 5 
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Universe: Shape and Form in the Natural World, by S. Hildebrandt and A. Tromba, 
Springer-Verlag, New York/Berlin, 1995.) 

The key to showing that f(x, y, z) = xyz really has a maximum lies in the fact that 
f isacontinuous function that is defined on the unbounded surface S: xy +xz+yz = 5, 
and not on abounded set, which includes its boundary, where Theorem 7 of Section 3.3 
would apply. We have already seen problems of this sort for functions of one and two 
variables. 

The way to show that f(x, y,z) = xyz > 0 does indeed have a maximum on 
xy + yz + Xz = 5 is to show that if x, y, or z tend to oo, then f(x, y,z) > 0. We 
may then conclude that the maximum of f on S must exist by appealing to Theo- 
rem 7 (you should supply the details). So, suppose (x, y, z) lies in S and x —> co; 
then y — 0 and z — 0 (why?). Multiplying the equation defining S by z, we 
obtain the equation xyz + xz? + yz? = 5z > 0 asx — oo. Because x, y,z > 0, 
xyz = f(x, y,z) — 0. Similarly, f(x, y, z) — 0 if either y or z tend to oo. Thus, a 
box of maximum volume must exist. 

Some general guidelines may be useful for maximum and minimum problems with 
constraints. First of all, if the surface S is bounded (as an ellipsoid is, for example), then 
f musthave a maximum and a minimum on S. (SeeTheorem 7 in the preceding section.) 
In particular, if f has only two points satisfying the conditions of the Lagrange multiplier 
theorems or Theorem 9, then one must be a maximum and one must be a minimum. 
Evaluating f at each point will tell the maximum from the minimum. However, if there 
are more than two such points, some can be saddle points. Also, if S is not bounded (for 
example, if itis a hyperboloid), then f need not have any maxima or minima. 


Several Constraints 
If a surface S is defined by a number of constraints, namely, 


Qi(X1,..-, Xn) = C1 
al coer Xn) = C2 l (4) 
Oi(Xay.+ +1 Xn) = Ck 


then the L agrange multiplier theorem may be generalized as follows: If f hasa maximum 
or a minimum at % on S, there must exist constants A1, ..., Ax such that! 


VE (0) = A179100) + + + A VOK(G). (5) 


This case may be proved by generalizing the method used to prove the L agrange mul- 
tiplier theorem. Let us give an example of how this more general formulation is used. 


Find the extreme points of f(x,y,z) = x + y +z subject to the two conditions 
x* +y? =2andx+z=1. 


Here there are two constraints: 


gi(X, y,z) =x*+y?—2=0 and g(x, y,z)=x+z-1=0. 


11A s with the hypothesis Vg(%) 4 Oin the Lagrange multiplier theorem, here we must assume that the 
vectors V(X), -.-, Vi (%X) arelinearly independent; thatis, each Vg; (X) isnota linear combination 
of the other Vgj(Xo), j i. 
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Higher-Order Derivatives: Maxima and Minima 


example 6 


solution 


Thus, we must find x, y, Z, Ai, and A2 such that 


VE (x, Y, Z) = Ar Vgu(X, Y, Z) + A2Vg2(X, Y, Z), 
gi(x, y,z) =0, and g2(x, y,z) =0. 


Computing the gradients and equating components, we get 


1=A,-2x+A.-1, 
1=),-2y+2°0, 
1 = 1:0 +å2:1, 

x? +y? =2, and x+z=1. 


These are five equations for x, y, Z, Ai, and Az. From the third equation, 42 = 1, and 
so 2XA, = 0, 2yåı = 1. Because the second implies 4; 4 0, we have x = 0. Thus, 
y = +2 and z = 1. Hence, the possible extrema are (0, +./2, 1). By inspection, 
(0, 2, 1) gives a relative maximum, and (0, —/2, 1) a relative minimum. 

The condition x? + y? = 2 implies that x and y must be bounded. The condition 
x +z = 1 implies that z is also bounded. If follows that the constraint set S is closed 
and bounded. By Theorem 7 it follows that f has a maximum and minimum on S that 
must therefore occur at (0, /2, 1) and (0, —/2, 1), respectively. A 


The method of Lagrange multipliers provides us with another tool to locate the 
absolute maxima and minima of differentiable functions on bounded regions in IR? (see 
the strategy for finding absolute maximum and minimum in Section 3.3). 


Find the absolute maximum of f(x, y) = xy on the unit disc D, where D is the set of 
points (x, y) with x? + y? < 1. 


By Theorem 7 in Section 3.3, we know the absolute maximum exists. First, we find all 
the critical points of f in U , the set of points (x, y) with x? + y? < 1. Because 


af d af 

TA y an 7a X, 
(0, 0) is the only critical point of f in U . Now consider f on the unit circle, the level 
curve g(x, y) = 1, where g(x, y) = x? + y?. To locate the maximum and minimum 
of f on C, we write down the Lagrange multiplier equations: Vf (x, y) = (y, x) = 
AVg(x, y) = A(2x, 2y) and x? + y? = 1. Rewriting these in component form, we get 


y = 22x, 
X= 2ary, 
xX +y? =l. 
Thus, 
y =4y, 
or A = 1/2 and y = +x, which means that x? + x? = 2x? = 1 or x = 41//2, 
y = +1//2. On C we compute four candidates for the absolute maximum and mini- 
mum, namely, 


TE g Ged 
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solution 


3.4 Constrained Extrema and Lagrange Multipliers 193 


The value of f at both (—1//2, —1/./2) and (1//2, 1//2) is 1/2. The value of f at 
(—1//2, 1/2) and (1/./2, —1/,/2) is —1/2, and the value of f at (0, 0) is 0. Therefore, 
the absolute maximum of f is 1/2 and the absolute minimum is —1/2, both occurring 
onC.At(0, 0), a2f/ax* = 0, a2f /ay? = 0 and a2f /ax ay = 1, so the discriminant is 
—land thus (0, 0) is a saddle point. A 


Find the absolute maximum and minimum of f(x, y) = 5x? + ży? in the elliptical 
region D defined by 5x? + y? <1. 


Again by Theorem 7, Section 3.3, the absolute maximum exists. We first locate the 
critical points of f in U , the set of points (x, y) with 5x? + y? < 1. Because 


af of 


ax Oya 


y, 


the only critical point is the origin (0, 0). 

We now find the maximum and minimum of f on C , the boundary of U , which is the 
level curve g(x, y) = 1, where g(x, y) = ix? +y?. The Lagrange multiplier equations 
are 


vf (x, y) = (x, y) = AVg(x, y) = A(X, 2y) 


and (x?/2) + y? = 1. In other words, 


X = 2X 
x? j 
— = 1 
I 


If x = 0, then y = +1 and à = 4. If y = 0, then x = +v2 and à = 1. If x # 0 
and y # 0, we get both A = 1 and 1/2, which is impossible. Thus, the candidates 
for the maxima and minima of f on C are (0, +1), (v2, 0) and for f inside D, the 
candidate is (0, 0). The value of f at (0, +1) is 1/2, at (+./2, 0) it is 1, and at (0, 0) 
itis 0. Thus, the absolute minimum of f occurs at (0, 0) and is 0. The absolute maximum 
of f on D is thus 1 and occurs at the points (+/2,0). A 


Global Maxima and Minima 


The method of Lagrange multipliers enhances our techniques for finding global maxima 
and minima. In this respect, the following is useful. 


Definition LetU bean open region in R” with boundary aU . We say that aU 
is smooth if dU is the level set of a smooth function g whose gradient Vg never 
vanishes (i.e., Vg 4 0). Then we have the following strategy. 
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Lagrange Multiplier Strategy for Finding Absolute Maxima and 
Minima on Regions with Boundary Let f beadifferentiable function on 
a closed and bounded region D = U u 0U,U open in R”, with smooth 
boundary aU . 

To find the absolute maximum and minimum of f on D: 


(i) Locate all critical points of f inU. 


(ii) Use the method of Lagrange multiplier to locate all the critical points of 
f|au. 


(iii) Compute the values of f at all these critical points. 


(iv) Select the largest and the smallest. 


Find the absolute maximum and minimum of the function f(x,y,z) = 
x? +y? +z? — x +yontheset D = {(x, y, z) | X? + yY? +z? < 1}. 


As in the previous examples, we know the absolute maximum and minimum exists. 
Now D =U U 3U , where 


U = {(x, y, Zz) |X? +y? +z? <1} 
and 
aU ={(x, y, Z) |X? + yY? +z? = 1}. 


Vf (x, y, z) = (2x — 1, 2y + 1, 22). 

Thus, Vf = 0 at(1/2, —1/2, 0) which is in U , the interior of D . 

Letg(x, y, Z) =x*+y*+z*. Then au isthelevel set g(x, y, z) = 1. B y the method 
of Lagrange multipliers, the maximum and minimum must occur at a critical point of 
f |aU ; that is, at a point x where Vf (x) = AVg(X) for some scalar A. 

Thus, 


(2x — 1, 2y + 1, 2z) = A(2x, 2y, 2z) 


or 
(i) 2x — 1 = 2Ax 
(ii) 2y +1 = 2ay 
(iii) 2z = 2Az 


If A = 1, then we would have 2x — 1 = 2x or —1 = 0, which is impossible. 
We may assume that à + 0 since if à = 0, we only get an interior point as above. 
Thus (iii) implies that z = 0 and 


(iv) x? +y? =1. 

Solving (i) and (ii) for x and y we find, 
(v) x =1/2(1—A) 
(vi) y = —1/2(1—A) 


example 9 


solution 
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Applying (iv) we can solve for A, namely A = 1+ (1/./2). Thus, from (v) and (vi) we 
have that x = +(1/,/2) and y = +(1/,/2); that is, we have four critical points on aU . 
Evaluating f at each of these points, we see that the maximum value for f on dU is 
1+4+2//2 = 1+ V2 and the minimum value is 1 — /2. The value of f at (1/2, —1/2) 
is —1/2. Comparing these values, noting that —1/2 < 1— V2, we see that the absolute 
minimum is —1/2, occurring at (1/2, —1/2), and that absolute maximum is 1 + ./2, 
occurring at (—1/./2,1/./2). 4 


Two Additional Applications 
We now present two further applications of the mathematical techniques developed in 
this section to geometry and to economics. We shall begin wth a geometric example. 
Suppose we have a curve defined by the equation 

p(x, y) = Ax?+2Bxy+Cy?-1=0. 


Find the maximum and minimum distance of the curve to the origin. (These are the 
lengths of the semimajor and the semiminor axis of this quadric.) 


The problem is equivalent to finding the extreme values of f(x, y) = x? + y? subject 
to the constraining condition @(x, y) = 0. Using the Lagrange multiplier method, we 
have the following equations: 


2x +A(2Ax + 2By) =0 (6) 
2y + A(2Bx + 2C y) =0 (7) 
Ax? +2Bxy+Cy? =1. (8) 


Adding x times equation (6) to y times equation (7), we obtain 
2(x? + y?) + 2a(Ax? + 2Bxy + Cy’) =0. 


By equation (8), it follows that x? + y? +A = 0. Lett = —1/A = 1/(x? + y?) [the case 
à = 0 is impossible, because (0, 0) is not on the curve ¢(x, y) = 0]. Then equations 
(6) and (7) can be written as follows: 


2(A —t)x + 2By =0 


(9) 
2Bx +2(C —t)y =0. 


If these two equations are to have a nontrivial solution [remember that (x, y) = (0, 0) 
is not on our curve and so is not a solution], it follows from a theorem of linear algebra 
that their determinant vanishes:1? 


iA B 


B alat 


Because this equation is quadratic in t, there are two solutions, which we shall call tı 


and tz. Because —A = x? + y?, we have \/x2+ y? = ./—2. Now ,/x2-+ y? is the 


12T he matrix of coefficients of the equations cannot have an inverse, because this would imply that the 
solution is zero. Recall that a matrix that does not have an inverse has determinant zero. 
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solution 


distance from the point (x, y) to the origin. Therefore, if (xı, y1) and (x2, y2) denote 
the nontrivial solutions to equation (9) corresponding to tı and tz, and if tı and tz are 
positive, we get /x + y3 = 1//& and ,/x? + y? = 1/,/f. Consequently, if ti > t, 
the lengths of the semiminor and semimajor axes are 1/,/t; and 1/,/t;, respectively. If 
the curve is an ellipse, both tı and tz are, in fact, real and positive. W hat happens with 
a hyperbola or a parabola? A 


Finally, we discuss an application to economics. 


Suppose that the output of a manufacturing firm is a quantity Q of a certain product, 
where Q isafunction f (K, L), where K is the amount of capital equipment (or invest- 
ment) and L is the amount of labor used. If the price of labor is p, the price of capital 
is q, and the firm can spend no more than B dollars, how can we find the amount of 
capital and labor to maximize the output Q ? 


We would expect that if the amount of capital or labor is increased, then the output Q 
should also increase; that is, 


aQ dQ 
K! and a oe 


We also expect that as more labor is added to a given amount of capital equipment, we 
get less additional output for our effort; that is, 


a’Q 


lz <0 


Similarly, 


a°Q 


With these assumptions on Q, it is reasonable to expect the level curves of out- 
put (called isoquants) Q(K,L) = c to look something like the curves sketched in 
Figure 3.4.5, with C1 < C2 < C3. 


LL 


figure 3.4.5 What is the largest value of Q in the shaded triangle? 
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We can interpret the convexity of the isoquants as follows: As we move to the right 
along a given isoquant, it takes more and more capital to replace a unit of labor and 
still produce the same output. The budget constraint means that we must stay inside the 
triangle bounded by the axes and the line pL +qK = B. Geometrically, itis clear that 
we produce the most by spending all our money in such a way as to pick the isoquant 
that just touches, but does not cross, the budget line. 

Because the maximum point lies on the boundary of our domain, we apply the 
method of Lagrange multipliers to find the maximum. To maximize Q = f(K,L) 
subject to the constraint pL + qK = B, we look for critical points of the auxiliary 
function, 


h(K,L,a) = f(K,L) —a(pL +qK —B). 


Thus, we want 


a = aa, aD is and pL +qKk =B. 


These are the conditions we must meet in order to maximize output. (You are asked to 
work out a specific case in Exercise 36.) A 


In the preceding example, à represents something interesting. Let k = qK and 
| = pL, so that k is the dollar value of the capital used and | is the dollar value of the 
labor used. Then the first two equations become 


aQ _ 10 13Q AQ 
ak ~~ q aK pəl al” 


Thus, atthe optimum production point the marginal changein output per dollar’s worth of 
additional capital investment is equal to the marginal change of output per dollar’s worth 
of additional labor, and A is this common value. At the optimum point, the exchange 
of a dollar's worth of capital for a dollar's worth of labor does not change the output. 
Away from the optimum point the marginal outputs are different, and one exchange or 
the other will increase the output. 


A Second-Derivative Test for Constrained Extrema 


In Section 3.3 we developed a second-derivative test for extrema of functions of 
several variables by looking at the second-degree term in the Taylor series of f. If 
the Hessian matrix of second partial derivatives is either positive-definite or negative- 
definite at a critical point of f, this point is a relative minimum or maximum, 
respectively. 

The question naturally arises as to whether there is a second-derivative test for max- 
imum and minimum problems in the presence of constraints. The answer is yes and the 
test involves a matrix called a bordered Hessian. We will first discuss the test and how 
to apply it for the case of a function f(x, y) of two variables subject to the constraint 
g(x, y) =C. 
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Theorem 10 Let f:U c R? > Randg:U c R? —> R be smooth (at 
least C *) functions. Let w € U, g(w) = c, and S be the level curve for g with 
value c. Assume that Vg(w) = Oand that there is a real number à such that 
Vf (Wp) = AVg(Wo). Form the auxiliary function h = f — Ag and the bordered 
Hessian determinant 


oO A 
ax ay 
ag ah an 


evaluated at w. 


əx əx? ax day 
ag əh 3 h 
əy axoy əy? 


(i) If |H | > 0, then w isa local maximum point for f |S. 
(ii) If |H | < 0, then w isa local minimum point for f |S. 
(iii) If |H] = 0, the test is inconclusive and w may be a minimum, a maximum, 
or neither. 


This theorem is proved in the Internet supplement for this section. 


example 11 Find extreme points of f (x, y) = (x — y)” subject to the constraint x? + y? = 1, where 
n>1. 


solution We set the first derivatives of the auxiliary function h defined by h(x, y,a) = 
(x — y)” — A(x? + y? — 1) equal to 0: 
n(x — y)" —2ax =0 
—n(x — y)""!— 2ay = 0 
(x? +y’ - 1) =0. 
From the first two equations we see that A(x + y) = 0. If à = 0, then x = y =+,/2/2. 
If à Æ 0, then x = —y. The four critical points are represented in Figure 3.4.6, and the 
corresponding values of f(x, y) are listed below: 


(A) x=V2/2 y=v2/2 À 


ll 
oO 


f(x, y)=0 


figure 3.4.6 The four critical points in Example 11. 


3.4 Constrained Extrema and Lagrange Multipliers 199 


(B) x=/2/2 y=—2/2 =n 
(C) x=-—42/2 y=-V2/2 1=0 f(x, y)= 
(D) x=-V2/2 y=V/2/2 X 
By inspection, we see that if n is even, then A and C are minimum points and B and 
D are maxima. If n is odd, then B is a maximum point, D is a minimum, and A and C 


are neither. L et us see whether Theorem 10 is consistent with these observations. 
The bordered Hessian determinant is 


0 —2x —2y 
|] =|—2x n(n —1)(x — y)"™? — 2a n(n — 1)(x — y)"=? 
2y =n(n=1)(x =y)' n(n = 1)(x — yy)? = 23, 


= —4n(n — 1)(x — y)’-?(x + y)? + 8A(x? — y2). 


Ifn =lorifn >3,|H| =0atA,B,C, andD.Ifn = 2, then |H | = 0 atB and D and 
—16 atA and C. Thus, the second-derivative test picks up the minima at A and C, but 
is inconclusive in testing the maxima at B and D for n = 2. It is also inconclusive for 
all other valuesofn. A 


J ust as in the unconstrained case, there is also a second-derivative test for functions 


of more than two variables. If we are to find extreme points for f (X1, ..., Xn) Subject 
to a single constraint g(X1,..., Xn) = c, we first form the bordered Hessian for the 
auxiliary function h(xX1,..., Xn) = f(X1,...,Xn) — A(g(X1, ..., Xn) — C) as follows: 
0 —99 a 43 Og 
OX1 0X2 OXn 
—dg əh ah a7h 
0X1 3x? 0X1 OX2 d 0X1 OXn 
—əg əh ah a7h 
8X2 9X, 9X2  ƏxX?  IX2ðXn 
—dg arth ah a7h 
Xn 9X1 OXn ƏX2ðXn ðX? 


Second, we examine the determinants of the diagonal submatrices of order >3 at the 
critical points of h. If they are all negative, that is, if 


0 ag ag ag 
0 _ 99° 3g 0X1 0X2 0X3 
0X1 OX2 ag ah a*h a*h 
ag a?h a?h OX. 3X?  əX1əðX2 ə3X13X3 
ax ax? axa) ~ i ag a?h a?h ah | ~ a 
ag a*h a¢h OX. OX, 3X2 3x? 3X2 3X3 
OX. ə3X1əðX2 OXF. ag ah a7h a7h 
3X3 OX, 9X3 9X2 9X3. OXF 
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then we are at a local minimum of f |S. If they start out with a positive 3 x 3 subde- 
terminant and alternate in sign (that is, >0, <0, >0, <0, ...), then we are at a local 
maximum. If they are all nonzero and do not fit one of these patterns, then the point is 
neither a maximum nor a minimum (it is said to be of the saddle type). 


example 12 | Study the local extreme points of f(x, y, z) = xyz on the surface of the unit sphere 
x? + y? + z? = 1 using the second-derivative test. 


solution Setting the partial derivatives of the auxiliary function h(x, y, z, 4) = xyz — A(x? + 
y? + z? — 1) equal to zero gives 


yz = 2AXx 
XZ = 21y 
Xy = 22 


xX +y? +z =l. 


Thus, 3xyz = 2A(x? + y? + z?) = 2a. If A = 0, the solutions are (x, y, z, à) = 
+1, 0, 0, 0), (0, +1, 0, 0),and (0, 0, +1, 0). If à 4 0, thenwehave2a = 3xyz = 612z, 
and so z? = 4. Similarly, x? = y? = $. Thus, the solutions are given by à = 3xyz = 
+,/3/6.The critical points of h and the corresponding values of f are given in Table 3.1. 
From it, we see that points E, F, G, and K are minima. Points D, H, 1, and J are maxima. 
To see whether this is in accord with the second-derivative test, we need to consider two 
determinants. First, we look at the following: 


~ 


0 —dg/dx  —əg/əðy 0 —2x —2y 
|>|=|—ag/ax a*h/ax? a2/axay|=|—2x -2a z 
—ag/ay azh/ax ay azh/ay? =2y z =2h 


= BAX? + Bay? + Bxyz = BAX? + y? + 222). 


Observe that sign (|H >|) = sign A = sign (xyz), where the sign of a number is 1 if that 
number is positive, or is —1 if that number is negative. Second, we consider 


0 —dg/ax —dg/day —dg/dz 
—ag/ax  ah/ax* = a*h/ax ay a*h/ax az 
—ag/ay a*h/axay a*h/ay* = a2h /ay az 
—ag/dz ah/axaz a*h/ayaz  aeh/az? 

0 -2x —2y -2z 
—2x -2r 2 y 
—2y z -2 x |’ 
—2z y X —2A 


IH 3| 


13For a detailed discussion, see C. Caratheodory, Calculus of Variations and Partial Differential 
Equations, Holden-Day, San Francisco, 1965; Y. Murata, Mathematics for Stability and Optimiza- 
tion of Economic Systems, Academic Press, New York, 1977, pp. 263-271; or D. Spring, Am. Math. 
Mon. 92 (1985): 631-643. 
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table 3.] The critical points A, B,..., J, K of h and corresponding values of f 
X y Z À f(x, y, Z) 
+A +1 0 0 0 0 
+B 0 +1 0 0 0 
+C 0 0 +1 0 0 
D /3/3 3/3 3/3 V3/6 3/9 
E —/3/3 J3/3 J3/3 —/3/6 —/3/9 
F V3/3 -/3/3 3/3 —/3/6 -/3/9 
G V3/3 J3/3 -J/3/3 —/3/6 —/3/9 
H J3/3 -J/3/3 —J/3/3 J3/6 V3/9 
| —/3/3 3/3 -/3/3 V3/6 3/9 
J —/3/3 —/3/3 3/3 /3/6 J3/9 
K —/3/3 —/3/3 -/3/3 —/3/6 —/3/9 


which works out to be +4 at points +A, +B, and +C and -# at the other eight points. 
AtE, F, G, and K, we have |H 2| < 0 and |H 3| < 0, and so the test indicates these are 
local minima. At D, H, |, and J we have |H 2| > 0 and |Ħ 3] < 0, and so the test says 


these are local maxima. Finally, the second-derivative test shows that +A , +B, and +C 


are saddle points. A 


exercises 


L Let f(x, y) =x? + 3y?. Find the maximum and 
minimum values of f subject to the given constraint. 


(a) x? +y? =1 
(b) x? +y? <1 


In Exercises 3 to 7, find the extrema of f subject to the stated constraints. 
3 f(x,y,z) =x — y +z, subject to x? + y? +z? = 2 
4 f(x, y) =x — y, subject to x? — y? = 2 
5. f(x, y) =x, subject to x? + 2y? = 3 
Find the relative extrema of f |S in Exercises 8 to 11. 
8 f: R? >R, (x, y) = x? + y?, S = {(x, 2) |x € R} 
9. f: R? >R, (x, y) x? +y4,5 ={(% y) ly > 2} 


10. f: R? —> R, (x, y) œ> x?—y?, S = {(x, cosx) | X € R} 


IL f: R? >R, (x, y, z) > x? + y? +z, S = {(x, y, 2) | 
Z>24+x? +y?} 


I2. Use the method of Lagrange multipliers to find the 
absolute maximum and minimum values of 


2 


6. 


Consider all rectangles with fixed perimeter p. Use 
Lagrange multipliers to show that the rectangle with 
maximal area is a square. 


f(x, y, z) =x + y + z, subject to 
x? — y? =1,2x +z =1 


. f(x, y) = 3x + 2y, subject to 2x? + 3y? = 3 


f(x, y) =X? + y? — x — y + 1 on the unit disc (see 
Example 10 of Section 3.3). 


Consider the function f (x, y) = x? + xy + y? defined 
on the unit disc, namely, D = {(x, y) | x? + y? < 1}. 
Use the method of Lagrange multipliers to locate the 
maximum and minimum points for f on the unit circle. 
Use this to determine the absolute maximum and 
minimum values for f on D. 
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14. Find the absolute maximum and minimum values of 


17. 


24. 
25. 


26. 


f(x, yY, Zz) = 2x + y, subject to the constraint 
X+y+z=1. 


Find the extrema of f(x, y) = 4x + 2y, subject to the 
constraint 2x2 + 3y? = 21. 


Use Lagrange multipliers to find the distance from the 
point (2, 0, —1) to the plane 3x — 2y +82+1=0. 
Compare your answer to Example 12 in Section 1.3. 


Find the maximum and minimum values attained by 
f(x, y, z) = xyz on the unit ball x? + y? +z? < 1. 


Let S be the sphere of radius 1 centered at (1, 2, 3). Find 
the distance from S to the plane x + y +z = 0. (HINT: 
Use Lagrange multipliers to find the distance from the 
plane to the center of the sphere.) 


(a) Find three numbers whose product is 27 and whose 
sum is minimal. 


(b) Find three numbers whose sum is 27 and whose 
product is maximal. 


A rectangular box with no top is to havea surface area 
of 16 m2. Find the dimensions that maximize its volume. 


Design a cylindrical can (with a lid) to contain 1 liter 
(= 1000 cm3) of water, using the minimum amount of 
metal. 


Show that solutions of equations (4) and (5) are in 
one-to-one correspondence with the critical points of 


A(X1, css Xir lr css: Ak) 
= f (X1, ..., Xn) — àilg1( X1, ---, Xn) — C1] 
— -e — AKLQk(X1, Xn) — Ck]. 


Find the absolute maximum and minimum for the 
function f(x, y, z) =x + y — z on the ball 
B = {(x, y, Z) | X? + yY? +2? < 1}. 


Repeat Exercise 23 for f(x, y, z) = X + yz. 


A rectangular mirror with area A square feet is to have 
trim along the edges. If the trim along the horizontal 
edges costs p cents per foot and that for the vertical 
edges costs q cents per foot, find the dimensions that 
will minimize the total cost. 


An irrigation canal in Arizona has concrete sides and 
bottom with trapezoidal cross section of area 

A = y(x + y tan 8) and wetted perimeter 

P =x + 2y/cosé, where x = bottom width, y = water 
depth, and 6 = side inclination, measured from vertical. 


3L 


The best design for a fixed inclination 6 is found by 
solving P = minimum subject to the condition 
A = constant. Show that y? = (Acos@)/(2 — sin 8). 


. Apply the second-derivative test to study the nature of 


the extrema in Exercises 3 and 7. 


A light ray travels from point A to point B crossing a 
boundary between two media (see Figure 3.4.7). In the 
first medium its speed is vı, and in the second it is v2. 
Show that the trip is made in minimum time when 
Snell's law holds: 


sing, v 


siné2 v2" 


A parcel delivery service requires that the dimensions of 
a rectangular box be such that the length plus twice the 
width plus twice the height be no more than 108 inches 
(I + 2w+ 2h < 108). What is the volume of the 
largest-volume box the company will deliver? 


l 
l 
l 
l 
l 
I 
l 
figure 3.4.7 Snell's law of refraction. 


Let P be a point on a surface S in R? defined by the 
equation f(x, y, z) = 1, where f is of class C2. 
Suppose that P is a point where the distance from the 
origin to S is maximized. Show that the vector 
emanating from the origin and ending at P is 
perpendicular to S. 


Let A be a nonzero symmetric 3 x 3 matrix. Thus, its 
entries satisfy aij = aji. Consider the function 
f(x) = 5(Ax) x 


(a) Whatis Vf? 


(b) Consider the restriction of f to the unit sphere 
S = {(x, y, Z) | X? +y? +z? = 1} in R?. By 
Theorem 7 we know that f must have a maximum 


and a minimum on S. Show that there must be an 
xe Sanda à # 0 such that Ax = Ax (The vector x 
is called an eigenvector, while the scalar A is called 
an eigenvalue.) 


(c) What are the maxima and minima for f on 
B ={(x, y, Z) | X? +y? +z? <1}? 


32. Suppose that A in the function f defined in Exercise 31 


is not necessarily symmetric. 
(a) Whatis Vf? 


(b) Can we conclude the existence of an eigenvector 
and eigenvalues as in Exercise 31? 
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36. A company’s production function is Q(x, y) = xy. The 


37. 


cost of production is C (x, y) = 2x + 3y. If this 
company can spend C (x, y) = 10, what is the maximum 
quantity that can be produced? 


Find the point on the curve (cost, sint, sin(t/2)) that is 
farthest from the origin. 


38. A firm uses wool and cotton fiber to produce cloth. The 


amount of cloth produced is given by Q(x, y) = 

Xy — x — y +1, where x is the number of pounds of 
wool, y the number of pounds of cotton, x > 1, and 

y > 1. If wool costs p dollars per pound, cotton costs q 


dollars per pound, and the firm can spend B dollars on 
material, what should the ratio of cotton and wool be to 
produce the most cloth? 


33. (a) Find the critical points of x + y2, subject to the 
constraint 2x2 + y? = 1. 


(b) Use the bordered Hessian to classify the critical 


points. 39. Carry out the analysis of Example 10 for the production 


function Q(K,L) =AK“%L!-®, where A and @ are 
positive constants and 0 < œ < 1. This is called a 
Cobb-Douglas production function and is sometimes 
used as a simple model for the national economy. 

Q is then the aggregate output of the economy for a 
given input of capital and labor. 


34. Answer the question posed in the last line of Example 9. 


35. Try to find the extrema of xy + yz among points 
satisfying xz = 1. 


3.5 The Implicit Function Theorem (Optional) 


In this section we state two versions of the implicit function theorem, arguably the most 
important theorem in all of mathematical analysis. The entire theoretical basis of the 
idea of a surface as well as the method of Lagrange multipliers depends on it. M oreover, 
it is a cornerstone of several fields of mathematics, such as differential topology and 
geometry. 


The One-Variable Implicit Function Theorem 


In one-variable calculus we learn the importance of the inversion process. For example, 
x = Iny is the inverse of y = e*%, and x = sin‘ y is the inverse of y = sinx. The 
inversion process is also important for functions of several variables; for example, the 
switch between Cartesian and polar coordinates in the plane involves inverting two 
functions of two variables. 

Recall from one-variable calculus that if y = f(x) isaC+ function and f’(x9) 40, 
then locally near Xo we can solve for x to give the inverse function: x = f-+(y). We 
learn that ( f -+)’( y) = 1/f’(x); that is, dx/dy = 1/(dy/dx). That y = f(x) can be 
inverted is plausible because f’(X9) 4 0 means that the slope of y = f(x) is nonzero, 
so that the graph is rising or falling near xo. Thus, if we reflect the graph across the line 
y =x, itis still a graph near (Xo, Yo), where yo = f (Xo). For example, in Figure 3.5.1, 
we can invert y = f(x) in the shaded box, so in this range, x = f(y) is defined. 


A Special Result 


We next turn to the situation for real-valued functions of variables x1, ..., Xn and z. 
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figure 3.5.1 If f’(xo) 4 0, then o , 
y= f% is locally invertible. y =f(x) is invertible 


EPON E near (Xo, Yo) 


Theorem 11 Speciallmplicit Function Theorem Supposethat F : R"+! > 
R has continuous partial derivatives. Denoting points in R"+! by (x z), where 
x eR" and z € R, assume that (X, Zo) satisfies 


oF 
F(X, Zo) =0 and oy (mr 20) #0. 


Then there is a ball U containing % in R" and a neighborhood V of Zp in R such 
that there is a unique function z = g(x) defined for xinU and zin V that satisfies 


F(x, g(x) = 0. 


Moreover, if xin U and z in V satisfy F(x z) = 0, then z = g(x). Finally, 
z = g(x) is continuously differentiable, with the derivative given by 


where D,F denotes the (partial) derivative of F with respect to the variable x— that 
is, we have DF = [0F /0Xj,..., OF /ƏXn]; in other words, 


3g _ _ OF /3Xi 


= : je ane iL 
aXi aF /az i i a 


A proof of this theorem is given in the Internet supplement. 

Onceitisknown thatz = g(x) exists andis differentiable, formula(1) may be checked 
by implicit differentiation; to see this, note that the chain rule applied to F (x, g(x) = 0 
gives 


DyF (x g(x) + Ea a(x) | [Dat =ü 


which is equivalent to formula (1). 


example | 
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In the special implicit function theorem, it is important to recognize the necessity of 
taking sufficiently small neighborhoods U and V . For example, consider the equation 


xX 4+77-1=0; 


that is, F (x, z) = x?+z*—1,withn = 1. Here (aF /az)(x, z) = 2z, and so the special 
implicit function theorem applies to a point (Xo, Zo), satisfying xê +z} — 1 = 0 and 
Zo Æ 0. Thus, near such points, zis auniquefunction of x. Thisfunctionisz = v1 — x2 
if Zo > 0 and z = —./1 — x? if Zo < 0. Note that z is defined for |x| < 1 only (U must 
not be too big) and z is unique only if it is near zọ (V must not be too big). These 
facts and the nonexistence of dz/dx at zo = 0 are, of course, clear from the fact that 
x? 4 2? = 1 defines a circle in the xz plane (Figure 3.5.2). 


figure 3.5.2 It is necessary to take small neighborhoods in 
the implicit function theorem. A 


The Implicit Function Theorem and Surfaces 


Let us apply Theorem 11 to the study of surfaces. We are concerned with the level 
set of a function g: U c R” — R; that is, with the surface S consisting of the set of 
x satisfying g(x) = Co, where Co = g(X) and where x is given. Let us taken = 3 
for concreteness. Thus, we are dealing with the level surface of a function g(x, y, z) 
through a given point (Xo, Yo, Zo). As in the Lagrange multiplier theorem, assume that 
Vg(Xo, Yo. Zo) # Q This means that at least one of the partial derivatives of g is nonzero. 
For definiteness, suppose that (dg /0Z)(Xo, Yo, Zo) # 0. By applying Theorem 11 to the 
function (x, y, Z) +> g(x, y, Z) — Co, we Know there is a unique function z = k(x, y) 
satisfying g(x, y, k(x, y)) = Co for (x, y) near (Xo, Yo) and z near zo. Thus, near Zp the 
surface S is the graph of the function k. Because k is continuously differentiable, this 
surface has a tangent plane at (Xo, Yo, Zo) given by 


ak ak 
Z=Zo+ Bao yd] — Xo) + Eto yo)|(y =w. (2) 
But by formula (1), 
ə 0g 
ak Fix, Yor Zo) 9k ay ™ Yor Zo) 
z (X00 Yo) = —oX and ay (Xo: Yo) = -y . 
zz” Yo, Zo) y gz” Yo, Zo) 
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figure 3.5.3 The construction of 
a path c(h) in the surface S 
whose tangent vector is v. 


example 2 


solution 


The path e(t) 


The line (Xo + t(x — Xo), Yo + tly — Yo)) 


Substituting these two equations into the equation for the tangent plane gives this equiv- 
alent description: 


ð 0 0 
0=(z- 20) — (Ko Yo, Zo) + (X — Xo) (Xo, Yo, Zo) + (Y — yo) 5, Yo, Zo); 
that is, 


(X — Xo, Y — Yo, Z — Zo) + VQ(Xo, Yo. Zo) = 0. 


Thus, the tangent plane to the level surface of g is the orthogonal complement to 
Vg(Xo, Yo, Zo) through the point (Xo, Yo, Zo). This agrees with our characterization of 
tangent planes to level sets from Chapter 2. 

We are now ready to complete the proof of the Lagrange multiplier theorem. To do 
this, we must show that every vector tangent to S at (Xo, Yo, Zo) is tangent to acurvein S. 
By Theorem 11, we need only show this for a graph of the form z = k(x, y). However, 
if v = (X — Xo, Y — Yo, Z — Zo) is tangent to the graph [that is, if it satisfies equation 
(2)], then vis tangent to the path in S given by 

Ct) = (Xo + t(X — Xo), Yo + tly — Yo), K(Xo + t(X — Xo), Yo +t(y — Yo))) 


at t = 0. This can be checked by using the chain rule. (See Figure 3.5.3.) 


Near what points may the surface 
x3 4 3y? + 8xz? — 3z°y =1 
be represented as a graph of a differentiable function z = k(x, y)? 


HerewetakeF (x, y, z) = x?-+3y?+8xz?—3z3y—1 and attemptto solve F (x, y, z) = 0 
for z as a function of (x, y). By Theorem 11, this may be done near a point (Xo, Yo, Zo) 
if (OF /dZ)(Xo, Yo, Zo) Æ 0, that is, if 


Zo(16Xo — 9Z0Yo) Æ 0, 
which means, in turn, 


Zo Æ 0 and 16Xọ Æ 9Zoyo. A 
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General Implicit Function Theorem 


Next we shall state, without proof, the general implicit function theorem.** Instead of 
attempting to solve one equation for one variable, we attempt to solve m equations for 
m variables Z1, ..., Zm: 


Fi(X1,..-)Xn,Z1,+--Zm) = 0 

Fo(X1,..-,Xn,Z1y---1Zm) =O 

f (3) 
Fm(Xi .. -1 Xnr Zir... Zm) = 0. 


In Theorem 11 we had the condition aF /az 4 0. The condition appropriate to the 
general implicit function theorem is that A + 0,15 where A is the determinant of the 
m x m matrix 


OFy OF, 
0Z1 dZm 
OF m OF m 
021 0Zm 


evaluated at the point (Xo, Z); in the neighborhood of such a point, we can uniquely 
solve for zin terms of x 


Theorem 12 General Implicit Function Theorem If A + 0, then near 
the point (Xo, Z), equation (3) defines unique (smooth) functions 


ZK (Xir Xn) (i =1,...,m). 


Their derivatives may be computed by implicit differentiation. 


14F or three different proofs of the general case, consult: 


(a) E. Goursat, A Course in Mathematical Analysis, |, Dover, New York, 1959, p. 45. (This proof 
derives the general theorem by successive application of Theorem 11.) 

(b) T. M. Apostol, Mathematical Analysis, 2d ed., Addison-Wesley, Reading, M ass., 1974. 

(c) J. E. Marsden and M . Hoffman, Elementary Classical Analysis, 2d ed., Freeman, New York, 1993. 


Of these sources, the last two use more sophisticated ideas that are usually not covered until a junior- 
level course in analysis. The first, however, is easily understood by the reader who has some knowledge 
of linear algebra. 


15For students who have had linear algebra: The condition A + 0 has a simple interpretation in the 
case that F is linear; namely, A + 0 is equivalent to the rank of F being equal to m, which in turn is 
equivalent to the fact that the solution space of F = 0 is m-dimensional. 
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example 3 | To show that near the point (x, y, u, v) = (1, 1, 1, 1), we can solve 


xu + yuu? = 2 
3 nw) 


uniquely for u and v as functions of x and y. Compute du/dx at the point (1, 1). 


solution | Tocheck solvability, we form the equations 


Fi(x, y, U, v) = Xu + yvu? — 2 
3 2342 


F2(X, yY, U, v) = XU? + y*v 
and the determinant 
OF, dF, 
au ðv 
OF2 dF? 
ðu ðv 
x+2yuv yu? 


Because A # 0, solvability is assured by the general implicit function theorem. To find 
du/dx, we implicitly differentiate the given equations in x using the chain rule: 


ee sy py eur 4 2ynu et <0 
ax vax yvu zx 7 
ðu ðv 
3x Paha 3 4 2 eee ; 
4 ax ae ax g 
Setting (x, y, u, v) = (1, 1, 1, 1) gives 
ðu ðv 
— +— = -1 
ax Dx 
ou dv 
3— +4— = _1 
ax T ox 


Solving for du/ax by multiplying the first equation by 4 and subtracting gives du/ax = 
1 


-} 4 


Inverse Function Theorem 


A special case of the general implicit function theorem is the inverse function theorem. 
Here we attempt to solve the n equations 


fi(X1, eee Xn) = yı 


(4) 


Orase Xa) =Yn 


for X1, ..., Xn as functions of yı, ..., Yn; that is, we are trying to invert the equa- 
tions of system (4). This is analogous to forming the inverses of functions like sin 
x = y and e* = y, with which you should be familiar from elementary calculus. 
Now, however, we are concerned with functions of several variables. The question of 


example 4 


solution 
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solvability is answered by the general implicit function theorem applied to the func- 
tions yi — f,(X1,..., Xn) with the unknowns X1, ..., Xn (called Z1, ..., Zn earlier). The 
condition for solvability in a neighborhood of a point X is A + 0, where A is the 
determinant of the matrix Df (x), and f =(fi,..., fn). The quantity A is denoted by 


O( fi, ..., fn)/3(Xir -- -1 Xn), OF A( Ya, --- Yn) /O(X1, --- Xn) Or J ( f )(%) and is called 
the J acobian determinant of f. Explicitly, 
LY 
a( f f ) 0X1 OXn 
S| SJ (F)(m) =]: : (5) 
a(X1, Xn) X=X 
ay ae Btn | ) 


Note that in the case when f is linear— for example f(x) = Ax, where A isann xn 
matrix— the condition A + 0 is equivalent to the fact that the determinant of A, det 
A +40, and from Section 1.5 we know that A, and therefore f, has an inverse. 

The] acobian determinant will play an important role in our work on integration (see 
Chapter 5). The following theorem summarizes this discussion: 


Theorem 13 Inverse Function Theorem Let U c R” be open and let 
f: U + R,..., fri U — R have continuous partial derivatives. Consider equa- 
tions (4) near a given solution Xo, yo. If J ( f)(%o) [defined by equation (5)] is 
nonzero, then equation (4) can be solved uniquely as x = g(y) for xnear X and 
ynear yo. M oreover, the function g has continuous partial derivatives. 


Consider the equations 


=U, sin X + COSY = v. 
Near which points (x, y), can we solve for x, y in terms of u, v? 


Here the functions are u = f,(x, y) = (x4 + y*)/x and v = f(x, y) = sinx + 
cosy. We want to know the points near which we can solve for x, y as functions 
of u and v. According to the inverse function theorem, we must first compute the 
Jacobian determinant a( fı, f2)/9(x, y). We take the domain of f = (fı, f2) to be 
U ={(x, y) € R? | x 40}. Now 


afi of 3x4 — y4 4y3 
(fi, fz) | ax dy | __ |—a = | sny a aay 4y? 
BG) Jaf af R. 3x") x COSX. 
əx ay cosx =siny 


Therefore, at points where this does not vanish we can solve for x, y in terms of u 
and v. In other words, we can solve for x, y near those x, y for which x + 0 and 
(sin y)( y* — 3x4) + 4xy? cosx. Such conditions generally cannot be solved explicitly. 
For example, if Xo = 2/2, Yo = 2/2, we can solve for x, y near (Xo, Yo) because there, 
Əl fi, f2)/3(x, yY) #0. A 
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exercises 


L Show that the equation x + y — z + cos(xyz) = 0 canbe 
solved for z = g(x, y) near the origin. Find Jx and ay 
at (0, 0). 


2 Show that xy + z + 3xz° = 4 is solvable for z asa 
function of (x, y) near (1, 0, 1). Compute az/ax and 
dz/ay at (1, 0). 


3. (a) Check directly (i.e., without using Theorem 11) 
where we can solve the equation 
F(x, y) = yY? +y + 3x +1 = 0 for y in terms of x. 


(b) Check that your answer in part (a) agrees with the 
answer you expect from the implicit function 
theorem. Compute dy/dx. 


4. Repeat Exercise 3 with 
F(x, y) =xy2 —2y +x24+2=0. 


5. Let F (x, y) = 0 define a curve in the xy plane through 
the point (xo, yo), where F is C 1. Assume that (dF /ay) 
(Xo, Yo) # 0. Show that this curve can be locally 
represented by the graph of a function y = g(x). Show 
that (i) the line orthogonal to VF (Xo, Yo) agrees with 
(ii) the tangent line to the graph of y = g(x). 


6. Consider the surface S given by 3y2z2 — 3x = 0. 


(a) Using the implicit function theorem, verify that we 
can solve for x as a function of y and z near any 
point on S. Explicitly write x as a function of y 
and z. 

Show that near (1, 1, —1) we can solve for either y 
or z, and give explicit expressions for these 
variables in terms of the other two. 


(b 


7. Show that x3z2 — z3yx = 0 is solvable for z as a 
function of (x, y) near (1, 1, 1), but not near the origin. 
Compute əz/əx and az/dy at (1, 1). 


8. Discuss the solvability in the system 
3x +2y+7+u+4+v2 =0 
4x + 3y+z+u2+u+w+2=0 
X+z+w+u2+2=0 


foru, v, w in terms of x, y, z near 
X=y=z=0,u=v=0,andw =-2. 


9. Discuss the solvability of 


y+x+uv=0 
uxy+v=0 


for u, vin terms of x, y near x = y = u = v = Q and 
check directly. 


Investigate whether or not the system 


u(x, y,Z) = X +Xyz 
v(x, y, z) = y+xy 
w(x, y, Z) = Z + 2x + 3z? 


can be solved for x, y, z in terms of u, v, w near 
(x, y, z) = (0, 0, 0). 


Consider f(x, y) = ((x2 — y2)/(x2 + y?), 
xy/(x2 + y?)). Does this map of R2\(0, 0) to R2 have a 
local inverse near (x, y) = (0, 1)? 


(a) Define x: R? —> R by x(r, 8) = r cos@ and define 
y: R? —> R by y(r, 6) =r sing. Show that 


a(x, y) 
a(r, 0) (ro 80) 


=f0. 


(b) When can we form a smooth inverse function 
(r(x, y), A(x, y))? Check directly and with the 
inverse function theorem. 


(c) Consider the following transformations for 
spherical coordinates (see Section 1.4): 


X(p, ¢, 6) = psing cose 
X(p, ¢, 6) = psingsing 
2(p, ¢, 9) = pCcos¢. 


Show that the Jacobian determinant is given by 


A(X, y, Z) 


2 . 
———_ = p sing. 
eee es 
(d) When can we solve for (o, #, 6) in terms of 

(x, y, z)? 


Let (Xo, Yo, Zo) be a point of the locus defined by 
z? +xy —a = 0, z? + x? — y? — b = 0, where a and b 
are constants. 


(a) Under what conditions may the part of the locus 
near (Xo, Yo, Zo) be represented in the form 
x = f(z), y = g(z)? 

(b) Compute f’(z) and g'(2). 


Consider the unit sphere S given by x? + y? +z? = 1. 
S intersects the x axis at two points. Which variables can 
we solve for at these points? W hat about the points of 
intersection of S with the y and z axes? 


15. Let F (x, y) = x? — y? and let C denote the level curve 
given by F (x, y) =0. 


(a) Without using the implicit function theorem, show 
that we can describe C as the graph of x asa 
function of y near any point. 


(b) Show that Fx(0, 0) = 0. Does this contradict the 
implicit function theorem? 


16. Consider the system of equations 


xv? + 2y3u = 3 
3yu — xuv? = 2. 


Show that near the point (x, y, u, v) = (1, 1, 1, 1), this 
system defines u and v implicitly as functions of x and 
y. For such local functions u and v, define the local 
function f by f(x, y) = (u(x, y), v(x, y)). Find 
Df(1,1). 


17. Consider the equations 


x? — y? -u +v 4 =O 
2xy + y? — 2u? + 304 +8 = 0. 
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(a) Show that these equations determine functions 
u(x, y) and v(x, y) near the point 
(x, y,U, v) = (2, —1, 2, 1). 


(b) Compute %¥ at (x, y) = (2, —1). 


18. Isit possible to solve the system of equations 


xy? +XzZU + yu? = 3 
u2yz + 2xv — uĉv? = 2 
for u(x, y, Z), v(x, y, z) near 
(x, y,Z) = (1, 1, 1), (u, v) = (1, 1)? Compute av/ay at 
(X yi = (11) 


19. The problem of factoring a polynomial 
X" 4 a,_yx"-1 + ... + ag into linear factors is, in a 
sense, an “inverse function” problem. The coefficients a; 
may be thought of as functions of the n roots r j. We 
would like to find the roots as functions of the 
coefficients in some region. With n = 3, apply the 
inverse function theorem to this problem and state what 
it tells you about the possibility of doing this. 


L Let f be any differentiable function. Show that 
u = f(y —kx) isa solution to the partial differential 


. Ou 
equation 7X T =0. 


2 Provethatif u and v have continuous mixed second 
partial derivatives and satisfy the Cauchy-Riemann 


equations 
au _ dv 
əx ay 
ðu _ ðv 
ay ax’ 


then both u and v are harmonic. 


3. Let f(x, y) =x? — y? — xy +5. Find all critical points 
of f and determine whether they are local minima, local 
maxima, or saddle points. 


4. Find the absolute minimum and maximum values of the 
function f(x, y) = x? + 3xy + y? +5 on the unit disc 
D = {(x, y) | x? + y? < 1}. 


5. Find the second-order Taylor polynomial for 
f(x, y) = ye-*’ at (1, 1). 


6. Let f(x, y) = ax? + bxy + cy?. 
(a) Find g(x, y), the second-order Taylor approximation 
to f at (0, 0). 
(b) What is the relationship between g and f? 


(c) Prove that R2(%, h) = 0 for all x, he R2. (HINT: 
Show that f is equal to its second-order Taylor 
approximation at every point.) 


7. Analyze the behavior of the following functions at the 
indicated points. [Your answer in part (b) may depend on 
the constant C .] 

(a) z =x? — y? + 3xy, 
(b) z =x? — y? +Cxy, 


(x, y) =z (0, 0) 
(x, y) = (0, 0) 


8. Find and classify the extreme values (if any) of the 
functions on R? defined by the following expressions: 


(a) y? -— x3 
(b) (x — 1)? + (x — y)? 
(c) x2 + xy? + y4 


9. (a) Find the minimum distance from the origin in R? to 


the surface z = vx? — 1. 


(b) Repeat part (a) for the surface z = 6xy + 7. 
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10. Find the first few terms in the Taylor expansion of 13. Find and describe the critical points of 
f(x, y) = e*Y cosx about x = 0, y = 0. f(x, y) = y sin (xx) . (See Figure 3.R.3.) 


IL Prove that 


3x4 — 4x3 — 12x? + 18 


7= “A(T + ay?) 


has one local maximum, one local minimum, and one 
saddle point. (The graph is shown in Figure 3.R.1.) 


aN 


figure 3.R.3 Graph of z= ysin (x). 


14. A graph of the function z = sin(zx)/(1+ y?) is shown 
in Figure 3.R.4. Verify that this function has alternating 
maxima and minima on the x axis, with no other critical 
points. 


figure 3.R.1 Graph of 
z= (8x4 — 4x8 — 12x? + 18)/1201 + 4). 


12. Find the maxima, minima, and saddles of the function 
z = (2 +co0Sx)(sinzy), which is graphed in 
Figure 3.R.2. 


SKÉ JO 
SAW 
HAWS 


figure 3.R.4 Graph of z = sin(rx)/(1+ y). 


figure 3.R.2 Graph of z = (2+ cos r xX) (sin x y). 


In Exercises 15 to 20, find the extrema of the given functions subject to the given constraints. 


15. f(x, y) = x? — 2xy + 2y?, subject to x? + y? = 1 20. z = cos? x + cos? y, subject to the condition 
X+y=x/4 

16. f(x, y) = xy — y’, subject to x? + y? = 1 

wee My a 2L Find the points on the surface z? — xy = 1 nearest to the 


17. f(x,y) = cos(x? — y2), subject to x? + y? = 1 origin. 
Poy 22. Use the implicit function theorem to compute dy/dx for 
18. ee ae a (a) x/y =10 


(b) x3-—siny+y4=4 
19. z = xy, subject to the condition x + y = 1 (c) e + y3 =0 


23. 


3L 


Find the shortest distance from the point (0, b) to the 
parabola x? — 4y = 0. Solve this problem using the 
Lagrange multiplier method and also without using 
Lagrange’s method. 


Determine all values of k for which the function 
g(x, y, Z) = X2 +kxy + kxz + ky? + kz? has a local 
minimum at (0, 0, 0). 


Find and classify all critical points of the function 
g(x, y) = 4x4 — 3x3 + y3 + 3x? — 3y? + 20. 


Solve the following geometric problems by Lagrange’s 
method. 


(a) Find the shortest distance from the point (a1, a2, a3) 
in R? to the plane whose equation is given by 

bixi + b2x2 + b3x3 + bo = 0, where 

(b1, b2, b3) # (0, 0, 0). 

Find the point on the line of intersection of the two 
planes aX + a2X2 + a3X3 = 0 and 

b1X1 + b2X2 + b3x3 + bo = 0 that is nearest to the 
origin. 

Show that the volume of the largest rectangular 
parallelepiped that can be inscribed in the ellipsoid 


(b 


~ 
Ce! 


is 8abc/3V3. 


a A particle moves in a potential V (x, y) = 


x3 — y? +x? + 3xy. Determine whether (0, 0) is a stable 
equilibrium point; that is, whether or not (0, 0) is a strict 
local minimum of V . 


Study the nature of the function f (x, y) = x? — 3xy? 
near (0, 0). Show that the point (0, 0) is a degenerate 
critical point; that is, D = 0. This surface is called a 
monkey saddle. 


Find the maximum of f(x, y) = xy on the curve 
(x4+1)24y? =1. 


Find the maximum and minimum of 
f(x, y) =xy —y +x — onthe set x? + y? < 2. 


The Baraboo, Wisconsin, plant of International Widget 
Co., Inc., uses aluminium, iron, and magnesium to 
produce high-quality widgets. The quantity of widgets 
that may be produced using x tons of aluminum, y tons 
of iron, and z tons of magnesium is Q (x, y, z) = xyz. 
The cost of raw materials is aluminum, $6 per ton; iron, 
$4 per ton; and magnesium, $8 per ton. How many tons 
each of aluminum, iron, and magnesium should be used 
to manufacture 1000 widgets at the lowest possible cost? 
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(HINT: Find an extreme value for what function subject 
to what constraint.) 


Let f: R > R be of class C t and let 


u = f(x) 
v = —y + xf (x). 


If f’(xo) + 0, show that this transformation of R? to R2 
is invertible near (Xo, yo) and its inverse is given by 


flu) 
—v +uf-(u), 


X 


Show that the pair of equations 


x? — y? -u+ +40 
2xy + y? — 2u? +304 +8 = 0 


determine functions u(x, y) and v(x, y) defined for 
(x, y) near x = 2 and y = —1 such that u(2, —1) = 2 
and v(2, —1) = 1. Compute du/ax at (2, —1). 


Show that there are positive numbers p and q and unique 
functions u and v from the interval (—1 — p, —1 + p) 
into the interval (1 — q, 1 + q) satisfying 


xet) 4 u(x)er™) = 0 = xet 4 v(x)! 


for all x in the interval (—1 — p, —1 + p) with 
u(—1) = 1 = v(-]). 


35. To work this exercise, you should be familiar with 


the technique of diagonalizing a 2 x 2 matrix. Let 

a(x), b(x), and c(x) be three continuous functions 
defined on U U aU, where U is an open set and 3U 
denotes its set of boundary points (see Section 2.2). Use 
the notation of Lemma 2 in Section 3.3, and assume that 
for each x e U U ðU the quadratic form defined by the 
matrix 


is positive-definite. For aC 2 function v onU U aU, we 
define a differential operator L by 


Lv =a(a2v/ax2) + 2b(a2v/axay) + c(3?v/əy?). 


With this positive-definite condition, such an operator is 
said to be elliptic. A function v is said to be strictly 
subharmonic relative to L if Lv > 0. Show that a 
strictly subharmonic function cannot have a maximum 
pointinU. 
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36. A function v is said to be in the kernel of the operator L 
described in Exercise 35 if Lv = 0 on U U 3U . Arguing 
as in Exercise 47 of Section 3.3, show that if v achieves 
its maximum on U , it also achieves it on dU . This is 
called the weak maximum principle for elliptic 
operators. 


37. Let L bean elliptic differential operator as in Exercises 
35 and 36. 
(a) Define the notion of a strict superharmonic function. 
(b) Show that such functions cannot achieve a minimum 
onU. 
If v is as in Exercise 36, show that if v achieves its 
minimum on U , it also achieves it on aU . 


(c 


38. Consider the surface S given by 
x?z +xsiny + ye?! = 1. 


(a) Find the equation of the tangent plane to S at the 


point (1, 0, 1). 

Is it possible to solve the equation defining S for the 
variable y as a function of the variables x and z near 
(1, 0, 1)? Why? 

Find ¥ at (1, 0, 1) 


(b 


(c 
39. Consider the system of equations 


2xu3v—yv=1 
3 Die 
you + x°ur = 2 


Show that near the point (x, y, u, v) = (1, 1, 1, 1), this 
system defines u and v implicitly as functions of x and y. 
For such local functions u and v, define the local function 
f by f(x, y) = (u(x, y), v(x, y)). Find D f (1, 1). 


The following method of least squares should be applied to 
Exercises 40 to 45. 

It sometimes happens that the theory behind an 
experiment indicates that the experimental data should lie 
approximately along a straight line of the form y = mx + b. 
The actual results, of course, never match the theory exactly. 
We are then faced with the problem of finding the straight line 
that best fits some set of experimental data (x1, y1), 

(Xn, Yn) as in Figure 3.R.5. If we guess at a straight line 
y = mx + b to fit the data, each point will deviate vertically 
from the line by an amount dj = yi — (mxi + b). 

We would like to choose m and b in such a way as to 
make the total effect of these deviations as small as possible. 
However, because some are negative and some positive, we 
could get a lot of cancellations and still have a pretty bad fit. 
This leads us to suspect that a better measure of the total error 
might be the sum of the squares of these deviations. Thus, we 
are led to the problem of finding the m and b that minimize 


the function 
n 
s= f(m, b) =df+d}+---+dh = X. (yi -mxi —b)?, 
i=l 


where x1, ..., Xn and y1,..., Yn are the given data. 


(x3, Y3) 


figure 3.R.5 The method of least squares tries to find a 
straight line that best approximates a set of data. 


4O. For each set of three data points, plot the points, write 
down the function f (m, b) from the preceding equation, 
find m and b to give the best straight-line fit according to 
the method of least squares, and plot the straight line. 


(a) (xı, Y1) = (1, 1) 
(X2, Y2) = (2, 3) 
(x3, y3) = (4, 3) 


(b) (x1, y1) = (0, 0) 
(x2, Y2) = (1, 2) 
(x3, y3) = (2, 3) 
41. Show that if only two data points (x1, y1) and (x2, y2) 
are given, this method produces the line through (x1, y1) 
and (x2, y2). 


42. Show that the equations for a critical point, as/ab = 0 
and əs/əm = 0, are equivalent to 


(Za) (5) 


and 


n(x?) +(x) = (Ean): 
where all the sums run from i = 1 toi =n. 


43. If y = mx +b is the best-fitting straight line to the data 
points (x1, Y1), .--, (Xn, Yn) according to the 


least-square method, show that 


n 
$ (yi -mx — b) =0; 
i=l 


that is, the positive and negative deviations cancel (see 
Exercise 42). 


44, Use the second-derivative test to show that the critical 
point of f isa minimum. 


45. Use the method of least squares to find the straight line 
that best fits the points (0, 1), (1, 3), (2, 2), (3, 4), and 
(4, 5). Plot the points and line.16 


46. The partial differential equation 


where c is a constant, comes up in the study of 
deflections of a thin beam. Show that 


u(x, t) = sin(Amx) cos(a2z2ct) 


is a solution for any choice of the parameter A. 
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47. The K ortweg-DeV ries equation 


arises in modeling shallow water waves (called solitons). 
Show that 


u(x,t) = 12a2sech2(ax — 4a3t) 


is a solution to the K ortweg-D eV ries equation (see the 
Internet supplement). 


48. The heat-conduction equation in two space dimensions is 
k(Uxx + Uyy) = Ut. 


Assuming that u(x, y, t) = X (x)Y (y)T (t), find ordinary 
differential equations satisfied by X (x), Y (y), and T (t). 


49. The heat conduction equation in two space dimensions 
may be expressed in terms of polar coordinates as 


k 1 1 
(ur + ol + =U00 J = Ut. 


Assuming that u(r, 6, t) = R(r)@(6)T (t), find ordinary 
differential equations satisfied by R(r), @(6), and T (t). 


16T he method of least squares may be varied and generalized in a number of ways. The basic idea can be applied to equations of more 
complicated curves than the straight line. For example, this might be done to find the parabola that best fits a given set of data points. These 
ideas also formed part of the basis for the development of the science of cybernetics by Norbert Wiener. A nother version of the data is the 
following problem of least-square approximation: Given a function f defined and integrable on an interval [a, b], find a polynomial P of 


b 
J | f(x) — P(x) |? dx 


degree <n such that the mean square error 


iS as small as possible. 
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Vector-Valued Functions 


... who by vigor of mind almost divine, the motions and figures of the planets, the paths 


of comets, and the tides of the seas first demonstrated. —Newton’s Epitaph 


Chapters 2 and 3 focused on real-valued functions. This chapter is 
largely concerned with vector-valued functions. We begin in the first 
section of this chapter with a continuation of our study of paths, adding 
applications of Newton's second law. Then we study arc length of paths. 
Following this, we introduce the divergence and curl of a vector field 
which, in addition to the gradient, are basic operations in vector dif- 
ferential calculus. Then the geometry and calculus of the divergence 
and curl are studied. The associated integral calculus will be given in 
Chapter 8. 


4.1 Acceleration and Newton's Second Law 


In Section 2.4 we studied the basic geometry of paths, learning how to sketch curves 
(the images of paths) and compute tangent lines. We also learned to think of, as the name 
suggests, a path as the trajectory of a particle and to regard the derivative of the path as 
its velocity vector. In this section we continue our study of paths, including additional 
topics, especially acceleration and Newton's second law. 


Differentiation of Paths 


Recall that a path in R” is a map cof R or an interval in R to R". If the path is differ- 
entiable, its derivative at each timet isan n x 1 matrix. Specifically, if xz(t), ..., Xn(t) 
are the component functions of c the derivative matrix is 


dxı/dt 
dx2/dt 


c(t) = 


dx,/dt 
which can also be written in vector form as 
(dxı/dt, ..., dX, /dt) oras = (X;(t), ..., xf(t)). 
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example | 


solution 


example 2 


Recall from Section 2.4 that c(t) is the tangent vector to the path at the point dt). 
Also recall that if € represents the path of a moving particle, then its velocity vector is 


v= C(t), 


and its speed iss = ||v]|. 
The differentiation of paths is facilitated by the following rules. 


Let b(t) and dt) be differentiable paths in R? and p(t) 
and q(t) be differentiable scalar functions: 


Sum Rule: qth + dt)] = b(t) + c(t) 
Scalar Multiplication Rule: £ ipat] = p'(t)dt) + p(t)c(t) 
Dot Product Rule: S Tit) -dt)] = b(t) - dt) + b(t) - c(t) 
Cross P roduct Rule : S Tit) x dt)] = b(t) x dt) + b(t) x c(t) 


Chain Rule: f Idal) = q'(t)c(q(t)) 


These rules follow by applying the usual differentiation rules to the components. 


Show that if qt) is a vector function such that ||¢{t) || is constant, then ¢(t) is perpen- 
dicular to dt) for all t. 


Because ||qt)|| is constant, so is its square ||qt) ||? = dt) - qt). The derivative of this 
constant is zero, so by the dot product rule, 


“lat -dt)] = e(t)-dt) + dt) - c(t) = 2dt) - c(t); 


thus, dt) - c(t) = 0; that is, c(t) is perpendicular to qt). A 


0= 


For a path describing uniform rectilinear motion, the velocity vector is constant. In 
general, the velocity vector is a vector function w = c(t) that depends on t. The 
derivative a= dv/dt = c’(t) is called the acceleration of the curve. If the curve is 
(x(t), y(t), z(t)), then the acceleration at time t is given by 


alt) = x"(t)i+ y"(t)j + z”(t)k 


A particle moves in such a way that its acceleration is constantly equal to —K. If the 
position when t = 0 is (0, 0, 1) and the velocity att = 0 is i+ j, when and where 
does the particle fall below the plane z = 0? Describe the path traveled by the particle 
(assume t > 0). 


solution 


figure 4.1.1 The path of the 
particle with initial position 

(0, 0, 1), initial velocity i + j, and 
constant acceleration —k 

is a parabola in the plane y= x. 


example 3 


solution 
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Let dt) = (x(t), y(t), z(t)) = x(t)i+ y(t)j + z(t)k be the path traced out by the 
particle, so that the velocity vector is c(t) = x’(t)i+ y’(t)j + 2’(t)k The acceleration 
c’(t) is —k, so x”(t) = 0, y”(t) = 0, and z”(t) = —1. It follows that x’(t) and y’(t) are 
constant functions, and z’(t) is a linear function with slope —1. Because ¢(0) = i+ jJ, 
we get c(t) = i+ j — tk Integrating again and using the initial position (0, 0, 1), we 
find that (x(t), y(t), z(t)) = (t, t, 1 — 5t?). The particle drops below the plane z = 0 
when 1 — 5t? = 0; that is, t = /2 (because t > 0). At that instant, the position is 
(/2, /2, 0). The path traveled by the particle is a parabola in the plane y = x (see 
Figure 4.1.1), because in this plane the equation is described by z = 1 — 5x?, A 


The image of a C ! path is not necessarily “very smooth”; indeed, it may have sharp 
bends or changes of direction. For instance, the cycloid qt) = (t — sint, 1 — cost) 
shown in Figure 2.4.6 has cusps at all points where c touches the x axis (that is, when 
1 — cost = 0, which happens when t = 2xn, n = 0, +1, ...). Another example is the 
hypocycloid of four cusps, € [0,27] —> R?, t + (cos3t, sin? t), which has cusps at 
four points (Figure 4.1.2). At all such points, however, c(t) = O and the tangent line is 
not well defined. Evidently, the direction of c(t) may change abruptly at points where 
it slows to rest. 

A differentiable path cis said to be regular att = to if € (to) 4 O If c(t) = Ofor all 
t, we say that c is a regular path. In this case, the image curve looks smooth. 


A particle moves along a hypocycloid according to the equations 
x=cost, y=sint, a<t<b. 
W hat are the velocity and speed of the particle? 


The velocity vector of the particle is 


v= iy oj = —(3 sint cos? t)i + (3 cost sin? t)j, 


and its speed is 


s = ||vi] = (9 sin? t cos*t +9 cos?t sin*t)/? = 3 |sint| cost]. a 


Image of e 


c(t) figure 4.1.2 The image of the smooth path 
c(h = (cos? tsin? A, a hypocycloid, does 
x not “look smooth.” 
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figure 4.1.3 A mass M attracts a 
mass mwith a force F given by 
Newton's law of gravitation: 

F = —GmMt/r°. 


Newton's Second Law 


If a particle of mass m moves in R3, the force F acting on it at the point dt) is related 
to the acceleration a(t) by Newton’s second law:+ 


F(dt)) = mat). 


In particular, if no forces act on a particle, then a(t) = O so c(t) is constant and the 
particle follows a straight line. 


Acceleration and Newton’s Second Law The acceleration of a path 


ct) is 
If Fis the force acting and m is the mass of the particle, then 


F=ma 


In the problem of determining the path qt) of a particle under the influence of a given 
force field, F, Newton's law becomes a differential equation (i.e., an equation involving 
derivatives) for dt). 

For example, the motion of a planet moving along a path r(t) around the sun (con- 
sidered to be located at the origin in IR?) obeys the law 


_GmM 


E 
mr’ = ah 


where M is the mass of the sun, m that of the planet, r = |r|], and G is the gravita- 
tional constant. The relation used in determining the force, F = —G mM r/r3, is called 
Newton’s law of gravitation (see Figure 4.1.3). We shall not make a general study 
of such equations in this book, but content ourselves with the special case of cir- 
cular orbits. (M ore general orbits— the conic sections— are discussed in the Internet 
supplement.) 


Circular Orbits 


Consider a particle of mass m moving at constant speed s in a circular path of radius 
ro. Supposing that it moves in the xy plane, we can suppress the third component and 


write its location as 


AM ost scientists acknowledge that F = mais the single most important equation in all of science and 
engineering. 
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y 
A 


figure 4.1.4 The position, velocity, and acceleration 
of a particle in circular motion. 


Note that this is a circle of radius rọ and that its speed is given by |r’(t)|| = s. The 
quantity s/ro is called the frequency and is denoted w. Thus, 


r(t) = (ro cos wt, rosin at). 


The acceleration is given by 


were (fas fan) = Sao =v. 


Thus, the acceleration is in a direction opposite to r(t); thatis, it is directed toward the 
center of the circle (see Figure 4.1.4). This foes multiplied by the mass of the 
particle is called the centripetal force. Even though the speed is constant, the direction 
of the velocity is continuously changing and therefore the acceleration, which is arate 
of change in either speed or direction or both, is nonzero. 

Newton's law helps us discover a relationship between the radius of the orbit of a 
revolving body and the period; that is, the time it takes for one complete revolution. 
Consider a satellite of mass m moving with a speed s around a central body with mass 
M inacircular orbit of radius ro (distance from the center of the spherical central body). 
By Newton's second law F = ma we get 


s?m GmM 
z r(t) = z r(t) 
r rè 


The lengths of the vectors on both sides of this equation must be equal. Hence 
s? = on, 
ro 


If T denotes the period, then s = 2xro/T ; substituting this value for s in the preceding 
equation and solving for T , we obtain the following: 


Kepler’s Law 
w ow? 


Thus, the square of the period is proportional to the cube of the radius. 
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example 4 


solution 


Historical Note 


We have defined two basic concepts associated with a path: its velocity and its 
acceleration. Both involve differential calculus. The basic concept of the length of a 
path, which involves integral calculus, will be taken up in the next section. 


Suppose that a satellite is to be in a circular orbit about the earth such that it stays fixed 
in the sky over one point on the equator. W hat is the radius of such a geosynchronous 
orbit? (The mass of the earth is 5.98 x 10% kilograms and G = 6.67 x 107! in the 
meter-kilogram-second [kgs] system of units.) 


The period of the satellite should be 1 day, so T = 60 x 60 x 24 = 86,400 seconds. 
From the formula T ? = rg(2z)?/GM , we getr =T°GM /(2z)’, and so 


T2GM E (86,400)? x (6.67 x 1071!) x (5.98 x 1024) 
(2m)? (27)? 


Thus, ro = 4.23 x 107m = 42,300 km ~ 26,200 mi. A 


~ 7.54 x 107 mè. 


3— 
ro = 


Supplement to Section 4.1: Planetary Orbits, Hamilton's 
Principle, and Spacecraft Trajectories 


In this section we have been studying paths in space and N ewton’s second law. Hopefully, 
you realize that these ideas apply to the real world— the motion of our earth around the 
sun, for example, is governed by these laws. But there is more to the story, and we will 
try to convey some of it here. 


Kepler, Newton, Hamilton, Feynman, and Planck 


As we discussed in the historical introduction, the law of planetary motion stating 
that the square of the period is proportional to the cube of the radius of an orbit is 
one of the three that Kepler observed before Newton formulated his laws of 
motion, known more generally as Newton's mechanics. These mechanics 
enable us to compute the period of a satellite about the earth or a planet about 
the sun (when the radius of its orbit is given), and, as we will indicate shortly, 
trajectories of space missions. 

Kepler discovered and used results like this not only for circular orbits but more 
generally for elliptical orbits. Newton was able to derive Kepler's three celestial 
laws from his own law of gravitation. The neat mathematical order of the universe 
that these laws provided had a great impact on eighteenth-century thought. 

Newton never wrote down his laws of mechanics as differential equations. 

This was first done by Euler around 1730. Newton made most of his deductions (at 
least those in published form) by geometric methods. Euler also showed how 
Newton’s equations followed from Maupertuis’ action principle. The clearest 
version of the action principle in mechanics, now known as Hamilton's 
principle, is due to William Rowan Hamilton around 1830, who, as we all should 
now know, happens to also be the father of vector calculus. Hamilton’s version of 
Maupertuis’ principle was elegantly presented by Richard Feynman, as we 
discuss next. 


figure 4.1.5 Richard P Feynman 
(1918-1988). 


figure 4.1.6 Feynman lecturing 
at Caltech. 
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FEYNMAN AND HAMILTON’S PRINCIPLE. In his legendary Caltech Lectures on 
Physics, Nobel Prize-winning physicist Richard Phillips Feynman (see Figures 4.1.5 
and 4.1.6) included what he called a “Special Lecture” on a topic clearly very 
close to his heart—one that he first heard about from his New York high school 
teacher, Mr. Bader. Mr. Bader told his (apparently bored) student Feynman how 
principles of maxima and minima apply to the trajectories of moving objects 
and in particular how the action principle of Maupertuis, Leibniz, and Hamilton 
(discussed in Section 3.3) applies to Newton’s mechanics, governed by F= ma. 

Professor Feynman, at the end of his lecture, notes that “a physicist, a student 
of Mr. Bader, in 1942 showed how this action principle applied to quantum 
mechanics.” That student was Feynman himself, who received the Nobel Prize for 
his insights, which also included the discovery of Feynman integrals. The moral 
here is pay attention to your teachers—especially the best ones! 

We include the first part of Feynman’s lecture here and more of it in the 
Internet supplement; see Volume II, Lecture 19, of the Feynman Lectures on 
Physics for the entire lecture. 


The Principle of Least Action, by Richard Feynman 

When | was in high school, my physics teacher—whose name was Mr. 
Bader—called me down one day after physics class and said, “You look 
bored; | want to tell you something interesting.” Then he told me something 
which I found absolutely fascinating, and have, since then, always found 
fascinating. Every time the subject comes up, | work on it. In fact, when | 
began to prepare this lecture | found myself making more analyses on the 
thing. Instead of worrying about the lecture, | got involved in a new 
problem. The subject is this—the principle of least action. 

Mr. Bader told me the following: Suppose you have a particle (in a 
gravitational field, for instance) which starts somewhere and moves to 
some other point by free motion—you throw it, and it goes up and comes 
down (see Figure 4.1.7). 


Actual motion 


Here 


t 


figure 4.1.7 


It goes from the original place to the final place in a certain amount of 
time. Now, you try a different motion. Suppose that to get from here to 
there, it went like this (see Figure 4.1.8), but got there in just the same 
amount of time. 

Then he said this: “If you calculate the kinetic energy at every moment 
on the path, take away the potential energy, and integrate it over the time 
during the whole path, you'll find that the number you'll get is bigger than 
that for the actual motion.” 
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Imagined 
motion 


Here 


t 


figure 4.1.8 


In other words, the laws of Newton could be stated not in the form 
F = ma but in the form: The average kinetic energy less the average 
potential energy is as little as possible for the path of an object going from 
one point to another. 

Let me illustrate a little better what this means. If you take the case of the 
gravitational field, then if the particle has the path x(t) (lets just take one 
dimension for a moment; we take a trajectory that goes up and down 
and not sideways), where x is the height above the ground, the kinetic 
energy is 5m(dx/dt)?, and the potential energy at any time is mgx. Now | 
take the kinetic energy minus the potential energy at every moment along 
the path and integrate that with respect to time from the initial time to the 
final time. Lets suppose that at the original time ft, we started at some 
height and at the end of the time f we are definitely ending at some other 
place (see Figure 4.1.9). 


> >< 


l 
l 
J 
t t2 


figure 4.1.9 


Then the integral is 


figure 4.1.11 Max Planck 
(1858-1947). 
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The actual motion is some kind of curve—it’s a parabola if we plot against 
the time—and gives a certain value for the integral. But we could imagine 
some other motion that went very high and came up and down in some 
peculiar way (see Figure 4.1.10). 


>t 


| 
ti t2 
figure 4.1.10 


We can calculate the kinetic energy minus the potential energy and 
integrate for such a path... or for any other path we want. The miracle is 
that the true path is the one for which that integral is least. 


MAX PLANCK AND THE PRINCIPLE OF LEAST ACTION. Max Planck (see 

Figure 4.1.11), one of the greatest scientists of the modern era and the discoverer 
of the “quantization” of nature, was also a profound believer in the mathematical 
design of the universe, and in particular the principle of least action. He argued 
that the universality of this principle was evidence of a divine creator, and the 
structure of the universe a result of divine reason. On June 29, 1922, on “Leibniz 
Day” in Berlin, Germany, just a few years after World War I, with all its terrible 
carnage, Planck delivered an address honoring this great scholar and his 
discovery of the principle of least action. 

In the following paragraphs we excerpt some of Planck’s remarks. 

Modern science, in particular under the influence of the development of the 
notion of causality, has moved far away from Leibniz’s teleological point of view. 
Science has abandoned the assumption of a special, anticipating reason, and it 
considers each event in the natural and spiritual world, at least in principle, as 
reducible to prior states. But still we notice a fact, particularly in the most exact 
science, which, at least in this context, is most surprising. Present-day physics, as 
far as it is theoretically organized, is completely governed by a system of 
space-time differential equations which state that each process in nature is 
totally determined by the events which occur in its immediate temporal and 
spatial neighborhood. 

This entire rich system of differential equations, though they differ in detail, 
since they refer to mechanical, electric, magnetic, and thermal processes, is 
now completely contained in a single dictum—fthe principle of least action. This, 
in short, states that, of all possible processes, the only ones that actually occur 
are those that involve minimum expenditure of action. As we can see, only a 
short step is required to recognize in the preference for the smallest quantity of 
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figure 4.1.12 Trajectory of the 
Genesis spacecraft from the 
earth to a periodic orbit about a 
million and a half kilometers from 
earth and the interesting return 
trajectory to earth. 


Moon's Return 
orbit trajectory 
to earth 


Transfer 
to halo 


500,000 


, Iln) 


action the ruling of divine reason, and thus to discover a part of Leibniz’s 
teleological ordering of the universe.? 

In present-day physics the principle of least action plays a relatively minor 
role. It does not quite fit into the framework of present theories. Of course, 
admittedly it is a correct statement; yet usually it serves not as the foundation of 
the theory, but as a true but dispensable appendix, because present theoretical 
physics is entirely tailored to the principle of infinitesimal local effects, and sees 
extensions to larger spaces and times as an unnecessary and uneconomical 
complication of the method of treatment. Hence, physics is inclined to view the 
principle of least action more as a formal and accidental curiosity than as a 
pillar of physical knowledge. 


Real-Life Trajectories 


Interesting paths in IR? that obey Newton's second law occur in our own solar system 
and are used by NASA to plan space missions. One such mission, the G enesis Discovery 
Mission, launched from earth A ugust 8, 2001, has a particularly interesting trajectory, 
as shown in Figure 4.1.12. More information about this trajectory and the mission 
objectives can be found at http://genesismission.jpl.nasa.gov/. 

The points denoted Lı and L2 in this figure denote places of balance (discovered 
by Euler) between the earth and the sun. A motionless spacecraft positioned there 
will remain there. There are periodic orbits about these points that we have (loosely) 
called halo orbits. The main dynamics of the spacecraft is governed by the pull of both 
the earth and the sun (and to a very small extent the moon) on the spacecraft. This 
is thus part of the famous three-body problem studied and made famous by Poincaré 
around 1890. 


For more information and history, consult S. Hildebrandt and A. J. Tromba, The Parsimonious U ni- 
verse: Shape and Form in the Natural World, Springer-Verlag, New York/Berlin, 1995. 

3For more information about Poincaré, see F. Diacu and P. Holmes, Celestial Encounters. The Origins 
of Chaos and Stability, Princeton U niversity Press: Princeton, NJ, 1996. 
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In Exercises 1 to 4, find the velocity and acceleration vectors and the equation of the tangent line for each of the following 


curves, at the given value of t. 
L r(t) = (cost)i+ (sin 2t)j, att = 0 
2. dt) =(t sint, t cost, /3t), att = 0 
In Exercises 5 to 8, let a(t) = ei + (sint)j + t3kand @(t) = 


eti+ (cost)j — 2t3k Find each of the stated derivatives in two 
different ways to verify the rules in the box preceding Example 1. 


d 
5. glat + lt] 
6 Haia] 
at Qt)-@ 
7. “fe(t) x ett)! 
. at cq x © 


a alt -Rel +alt)]} 


9. Consider the helix given by dt) = (a cost, a sint, bt). 
Show that the acceleration vector is always parallel to 
the xy plane. 


10. Prove the dot product rule. 


IL Determine which of the following paths are regular: 


(a) et) = (cost, sint, t) 
(b) et) = (t3, t5, cost) 
(c) dt) = (t2, et, 3t + 1) 


12. Let vand adenote the velocity and acceleration vectors 
of a particle moving on a path c Suppose the initial 
position of the particle is (0) = (3, 4, 0), the initial 
velocity is WO) = (1, 1, —2), and the acceleration 
function is at) = (0, 0, 6). Find wt) and dt). 


13. The acceleration, initial velocity, and initial position of a 
particle traveling through space are given by 


a(t) = (2, —6, —4), 0) =(—5,1,3), (0) = (6, —2, 1). 


The particle’s trajectory intersects the yz plane exactly 
twice. Find these two intersection points. 


14. The acceleration, initial velocity, and initial position of a 
particle traveling through space are given by 


at) = (—6,2,4), WO) =(2,-5,1), r({0) = (—3, 6, 2). 


3. r(t) = V2ti¢ej+etk att =0 


4 dt) =ti+tj+ 303k att =9 


The particle's trajectory intersects the yz plane exactly 
twice. Find these two intersection points. 


15. If r(t) = 6ti+ 3t2j + t3k, what force acts on a particle 
of mass m moving along ratt = 0? 


16. Leta particle of mass 1 gram (g) follow the path in 
Exercise 1, with units in seconds and centimeters. W hat 
force acts on it att = 0? (Give the units in your answer.) 


17. A body of mass 2 kilograms moves on a circle of radius 
3 meters, making one revolution every 5 seconds. Find 
the centripetal force acting on the body. 


18. Find the centripetal force acting on a body of mass 
4 kilograms (kg), moving on a circle of radius 10 meters 
(m) with a frequency of 2 revolutions per second (rps). 


19. Show that if the acceleration of an object is always 
perpendicular to the velocity, then the speed of the 
object is constant. (HINT: See Example 1.) 


20. Show that, at a local maximum or minimum of ||r(t) ||, 
the vector r’(t) is perpendicular to r(t). 


2L A satellite is in a circular orbit 500 miles above the 
surface of the earth. W hat is the period of the orbit? 
(You may take the radius of the earth to be 4000 miles, 
or 6.436 x 10° meters.) 


22. What is the acceleration of the satellite in Exercise 21? 
The centripetal force? 


23. Find the path csuch that d0) = (0, —5, 1) and 
c(t) = (t, et, t2). 


24. Let cbe a path in R? with zero acceleration. Prove that € 
is a straight line or a point. 


25. Find paths qt) that represent the following curves or 


228 Vector-Valued Functions 


trajectories. 


(a) {(x, y) | y =e*} 
(b) {(x, y) 14x2 +y? =1} 


Prove that 


T imdi x v(t)] = dt) x F(dt)) 


(c) A straight line in R? passing through the origin and (i.e., “rate of change of angular momentum = torque”). 
the point (a, b, c) W hat can you conclude if F(e(t)) is parallel to dt)? Is 
(d) {(x, y) | 9x? + 16y? = 4} this the case in planetary motion? 
26. Let dt) bea path, v(t) its velocity, and a(t) the 27. Continue the investigations in Exercise 26 to prove 
acceleration. Suppose F is aC! mapping of R? to R3, Kepler's law that a planet moving under the influence of 
m > 0, and F(dt)) = malt) (Newton's second law). gravity about the sun does so in a fixed plane. 


4.2 Arc Length 


example | 


Definition of Arc Length 


What is the length of a path qt)? Because the speed ||¢(t)|| is the rate of change of 
distance traveled with respect to time, the distance traveled by a point moving along the 
curve should be the integral of speed with respect to the time over the interval [to, t1] of 
travel time; that is, the length of the path, also called its arc length, is 


th 
Lid = e(t) I] dt. 


There is the question as to whether or not this formula actually corresponds to the 
truearc length. For example, suppose we take a curve in space and glue a string tightly to 
it, cutting the string so it exactly fits the curve. If we then remove the string, straighten it 
out, and measure it with a straight edge, we surely should obtain the length of the curve. 
That our formula for arc length agrees with such a process is justified in the supplement 
at the end of this section. 


Arc Length The ary the path dt) = (x(t), y(t), z(t)) for to < t < t, is 


Ues [° VEDER TOP 


The arc length of the path qt) = (r cost, r sint), for t lying in the interval [0, 27]; 
that is, for 0 <t < 2x, is 


27 
L(Q = {=r sint)? + (r cost)? dt = 2zr, 
0 


which is the circumference of a circle of radius r. If we had allowed 0 < t < 
4x, we would have obtained 4zr, because the path traverses the same circle twice 
(Figure 4.2.1). 


example 2 


solution 


example 3 


solution 
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» figure 4.2.1 The arc length of a circle traversed 
twice is 4rr. 
A 


For planar curves, we omit the z’(t) term, as in Example 1. Here is an example 
in R3. 
Consider the point with position function 
dt) = (t — sint, 1 — cost), 


which traces out the cycloid discussed in Section 2.4 (see Figure 2.4.6). Find the velocity, 
the speed, and the length of one arch. 


The velocity vector is c(t) = (1 — cost, sint), so the speed of the point dt) is 


I|¢(t) || = va — cost)? + sin?t = /2 — 2 cost. 


Hence, dt) moves at variable speed although the circle rolls at constant speed. F urther- 
more, the speed of œt) is zero when t is an integral multiple of 27. At these values of 
t, the y coordinate of the point dt) is zero, and so the point lies on the x axis. The arc 
length of one cycle is 


27 Qn = 
Lia = f V2—2 costat =2 f Ps = 
0 0 


oF a ct t 
= 2 | sin zit (because 1= cost = 2 sin? — and sin— > 0 on [0, 2m) 
0 


2 27 
t 
= 4( -cos 5) 


If acurveis made up of a finite number of pieces, each of whichis C ! (with bounded 
derivative), we compute the arc length by adding the lengths of the component pieces. 
Such curves are called piecewise C 1. Sometimes we just say “piecewise smooth.” 


2x 
= 8. 
0 A 


A billiard ball on a pool table follows the path c [—1, 1] + R?, defined by dt) = 
(x(t), y(t), z(t)) = (ti, It — 11). Find the distance traveled by the ball. 


This path is not smooth, because x(t) = |t] is not differentiable at 0, nor is y(t) = 
t — 5| differentiable at F, However, if we divide the interval [—1, 1] into the pieces 
[—1, 0], [0, 4], and [4, 1], we see that x(t) and y(t) have continuous derivatives on each 
of the intervals [—1, 0], [0, 4], and [ż4, 1]. (See Figure 4.2.2.) 
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example 4 


solution 


$ e(—1) 


? figure 4.2.2 A piecewise smooth path. 
¢(0) e(1) 


N= = Nue N 


Nee 


ea/2 ! 


On [—1, 0], x(t) = —t, y(t) = —t + 4, so |e(t)i| = V2. Hence, the arc length of 
cbetween —1 and O is f°, /2dt = V2. Similarly, on [0, 3], x(t) = t, y(t) = -t +3, 
and again ||¢(t) || = ./2, so that the arc length of cbetween 0 and 5 is 5/2. Finally, on 
[5, 1] we have x(t) = t, y(t) = t — 5, and the arc length of cbetween 5 and 1 is 5/2. 
Thus, the total arc length of cis 2/2. Of course, we can also compute the answer as the 
sum of the distances from d —1) to d0) to d4) tod1). A 


Find the arc length of (cost, sint, t?), 0 <t < x. 


Thepath dt) = (cost, sint, t?) has the velocity vector given by v = (—sint, cost, 2t). 


Because 
2 
Ivii = v sin?t + cos? t + 4t2 = \/1 4 4t2 = 24/t2 + (5) 
the arc length is 
Sa 1 2 
L(g =f 24/t2 + (5) dt. 
0 2 


This integral may be evaluated using the following formula from the table of integrals: 


[VER eax = Sevier ta log(x + Vx? +a?)] +C. 
Thus, 


a OOO 
saegal yea) ata) 


= 5 144r? + 7 log (20 + V14 4r?) ~ 10.63. 


As a check on our answer, we may note that the path c connects the points (1, 0, 0) and 
(—1, 0, x”). The distance between these points is V4 + 2? ~ 3.72, which is less than 
10.63, as it should be. A 


The Differential of Arc Length 


The arc-length formula suggests that we introduce the following notation, which will 
be useful in Chapter 7 in our discussion of line integrals. 


figure 4.2.3 Differential of arc 
length. 
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Arc-Length Differential An infinitesimal displacement of a particle follow- 
ing a path qt) = x(t)i+ y(t)j + z(t)k is 


r A ox, chy, Gz 
ds=dxi+dyj+dzk= ( i+ i+ Ek) dt 
and its length 


2 2 2 
is = Verr TE- | (#) +(3) +(%) dt 


dt dt 
is the differential of arc length. See Figure 4.2.3. 


These formulas help us remember the arc-length formula as 
tı 
arc length = ds. 
to 


As we have done before with such geometric concepts as length and angle, we can 
extend the notion of arc length to paths in n-dimensional space. 


Arc Length in R” Letec [to, tı] + R” be a piecewise C1 path. Its length is 
defined to be 


th 
Lid =| let) | dt. 


The integrand is the square root of the sum of the squares of the coordinate 
functions of c(t): If 


Ct) = (xXi(t), X2(t), ..., Xn(t)), 
then 


th 
Ld =f s/loalth)? + (g(t)? +--+ Og (OP at 
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example 5 


solution 


example 6 


Find the length of the path qt) = (cost, sint, cos2t, sin 2t) in R*, defined on the 
interval from 0 to zr. 


We have c(t) = (—sint, cost, —2 sin 2t, 2 cos2t), and so 


lelt) = v sin2t + cos? t +4 sin? 2t + 4 cos? 2t = VI F4 = V5, 


a constant, so the length of the path is 


f NSde= vin. 


It is common to introduce the arc-length function s(t) associated to a path dt) 
given by 


t 
s(t) = | ||¢(u)|| du, 
a 
so that (by the fundamental theorem of calculus) 
s'(t) = |}e(t) | 


and 


Consider the graph of a function of one variable y = f(x) for x in the interval [a, b]. 
We can consider it to be a curve parametrized by t = x, namely, dx) = (x, f (x)) for 
x ranging from a to b. The arc-length formula gives 


b 
L(g =d VI+ LEO dx, 


which agrees with the formula for the length of a graph from one-variable calculus. A 


Justification for the Arc-Length Formula 


The following discussion assumes an acquaintance with the definite integral defined 
in terms of Riemann sums. If your background in this topic needs reinforcement, the 
material may be postponed until after Chapter 5. 

In R? there is another way to justify the arc-length formula based on polygonal 
approximations. We partition the interval [a, b] into N subintervals of equal length: 


a=t)<t <- <t =b; 
b—a ; 
fa -f =- for 0<i<N-1. 


We then consider the polygonal line obtained by joining the successive pairs of 
points dti), dt ,1) forO <i <N — 1. This yields a polygonal approximation to cas 
in Figure 4.2.4. By the formula for distance in R?, it follows that the line segment from 


figure 4.2.4 A path c may be 
approximated by a polygonal 
path obtained by joining each 
e(f) to celh) by a straight line. 
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e(ty) = e(b) 


eee eo 8 
a=ty t tp ts ty =b 
c(to) = c(a) 


cati) to dti+ı) has length 


ldt) — ti) |] = vix (ti) — x(t)? + Ly(tiga) — y(t) PP + [z(tya) — z(t )1?, 


where qt) = (x(t), y(t), z(t)). Applying the mean-value theorem to x(t), y(t), and 
z(t) on [t;, ti+1], we obtain three points t*, t;**, and t*** such that 


X (tis) — X(t) = x(t") (ti — ti), 
y(ti4a) — y(t) = y/(t*) (tia il, 


and 
Z(t) — z(t) = z(t) (ta — ti). 


Thus, the line segment from dti) to et.) has length 


JOE FYE EE ty — ti). 


Therefore, the length of our approximating polygonal line is 


N-1 
Sy = > VIP FPP + Pb t). 


i=0 


ASN — ov, this polygonal line approximates the image of cmore closely. Therefore, 
we define the arc length of cas the limit, if it exists, of the sequence Sy as N — oo. 
Because the derivatives x’, y’, and z’ are all assumed to be continuous on [a, b], we can 
conclude that, in fact, the limit does exist and is given by 


b 
limit Sy = / VO + ly (HP + O dt. 


(The theory of integration relates the integral to sums by the formula 


b N-1 
f F(t) dt = limit SE FO t), 
à +00 a 
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where to, . 


.., ty is a partition of [a, b], t* € [t;, ti41] is arbitrary, and f is a continuous 


function. Here we have possibly different points të, t;*, and t;**, and so this formula 


must be extended slightly.) 


exercises 


Find the arc length of the given curve on the specified interval in Exercises 1 to 6.4 


L 
2 
3. 


4. 


5. 


6. 
7. 
8. 


(2 cost, 2 sint, t), for 0 < t < 27 
(1, 3t?, t3), foro <t <1 


(sin 3t, cos 3t, 2t3/2), foro <t <1 


(tsa 20257 +7, 5) fort <1 <2 


(t,t, t2), forl <t <2 


(t,t sint, t cost), for0 <t <x 
Find the arc length of qt) = (t, |t|) for —1 <t <1. 


Recall from Section 2.4 that a rolling circle of radius R 
traces out a cycloid, which can be parametrized by 

dt) = (Rt — R sint, R — R cost). One arch of the 
cycloid is completed from t = 0 to t = 2x. Show that 
the length of this arch is always 4 times the diameter of 
the rolling circle. 


Let C be the line segment connecting the point 

p= (1, 2, 0) to the point q = (0, 1, —1). 

(a) Find a curve dt): [a, b] —> R? that traces out C. 
(b) Find the arc length of dt). 

(c) Find ||p— ql. 


Compute the length of the curve 
dt) = (log( vt), V3t, 3t?) ford <t <2. 


Find the length of the path dt), defined by 
dt) = (2 cost, 2 sint, t), if 0 <t < 2x and 
ct) =(2,t — 2x, t), if 27 <t < 4r. 


4Several of these problems make use of the formula 
J x2+a2dx = iky +a? +a? log(x + \/x? +a?)] EKG 


from the table of integrals in the back of the book. 


12. Let cbe the path dt) = (t,t sint, t cost). Find the 


arc length of c between the two points (0, 0, 0) and 
(xr, 0, =x). 


Let cbe the path qt) = (2t, t2, logt), defined fort > 0. 
Find the arc length of c between the points (2, 1, 0) and 
(4, 4, log 2). 


14. Thearc-length function s(t) for a given path œt), 


defined by s(t) = f |¢(z) || dz, represents the distance 
a particle traversing the trajectory of cwill have traveled 
by time t if it starts out at time a; that is, it gives the 
length of c between da) and qt). Find the arc-length 
functions for the curves æ(t) =(cosht, sinht, t) and 
A(t) = (cost, sint, t), witha = 0. 


Let et) be a given path, a < t < b. Lets =a(t) bea 

new variable, where a is a strictly increasing C 1 

function given on [a, b]. For each s in[a@(a), a(b)] 

there is a uniquet with a(t) = s. Define the function 

d[ (a), a(b)] > R? by ds) = dt). 

(a) Argue that the image curves of cand dare the 
same. 

(b) Show that cand dhave the same arc length. 

(c) Lets = a(t) = Ji |¢(z) || dr. Define das above 
by d(s) = dt). Show that 


|g |-2 


The path s +> ds) is said to be an arc-length 
reparametrization of € (see also Exercise 17). 


16. Lete [a, b] —> R? bean infinitely differentiable path 
(derivatives of all orders exist). Assume c(t) 4 Ofor 
any t. The vector ¢(t) /|}¢(t) || = T(t) is tangent to cat 
dt), and, because ||T(t) || = 1, T is called the unit 
tangent to c 


(a) Show that T’(t) - T(t) = 0. [HinT: Differentiate 
T(t) - T(t) =1.] 


(b) Write down a formula for T’(t) in terms of c 


17. (a) A path qs) is said to be parametrized by arc length 
or, what is the same thing, to have unit speed if 
\|¢(s) || = 1. For a path parametrized by arc length 
on [a, b], show that l(c) = b — a. 


(b) The curvature at a point ds) ona path is defined by 
k = ||T’(s)|| when the path is parametrized by arc 
length. Show that k = ||¢’(s) ||. 

If cis given in terms of some other parameter t and 
c(t) is never Q show that 

k = e(t) x e(t) 11e (t). 

Calculate the curvature of the helix 

qt) = (1//2)(cost, sint, t). (This cis a scalar 
multiple of the right-circular helix.) 


— 
cy 


(d 


18. Show that any line I(t) = x + tv, where vis a unit 
vector, has zero curvature. 


19. Consider the parametrization of the unit circle given by 
dt) = (cost, sint). 


(a) Verify that cis parametrized by arc length. 
(b) Show that the curvature k of cis constant. 


20. If T(t) 4 O it follows from Exercise 16 that 
N(t) = T’(t)/|| T’(t) || is normal (i.e., perpendicular) to 
T(t); Nis called the principal normal vector. Let a third 
unit vector that is perpendicular to both T and N be 
defined by B = T x N; Bis called the binormal vector. 
Together, T, N, and B form a right-handed system of 
mutually orthogonal vectors that may be thought of as 
moving along the path (Figure 4.2.5). Show that 


(c) dB/dtisascalar multiple of N. 


2L If ds) is parametrized by arc length, we use the result of 
Exercise 20(c) to define a scalar-valued function t, 
called the torsion, by 
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Exercises 16, 17, and 20-23 develop some of the classic differential geometry of curves. 


X 


figure 4.2.5 The tangent T, principal normal 
N, and binormal B. 


(a) Show that r = [€ (s) x e’(s)]-€”(s) /|le"(s) 112. 
(b) Show that if cis given in terms of some other 
parameter t, 
ee [e(t) x e/(t)]-e”(t) 
IIe (t) x e(t)? 
Compare with Exercise 17(c). 


(c) Compute the torsion of the helix 
dt) =(1//2)(cost, sint, t). 


22. Show that if a path lies in a plane, then the torsion is 
zero. Do this by demonstrating that B is constant and is a 
normal vector to the plane in which clies. (If the torsion 
is not zero, it gives a measure of how fast the curve is 
twisting out of the plane of T and N.) 


23. (a) Use the results of Exercises 17, 20, and 21 to prove 
the following Frenet formulas for a unit-speed 


curve: 
of = kN; an = —kT + cB; 1B = —TtN. 
ds ds ds 


(b) Reexpress the results of part (a) as 


TE 


for a suitable vector w. 


24. In special relativity, the proper time of a path 
c [a, b] > R4 with components given by 
A) = (x(a), y(A), 2(A), t(A)) is defined to be the 
quantity 


b 
J Vx (AP = yA = (2a)? + lta da, 
a 
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where c is the velocity of light, a constant. In 25. The early Greeks knew that a straight line was the 
Figure 4.2.6, show that, using self-explanatory notation, shortest possible path between two points. Euclid, in his 
the “twin paradox inequality” holds: book Optics, stated the “principle of the reflection of 


light”— that is, light traveling in a plane travels ina 
straight line, and when it is reflected across a mirror, the 
angle of incidence equals the angle of reflection. 

The Greeks could not have had a proof that straight 
lines provided the shortest path between two points 
because they, in the first place, had no definition of the 
length of a path. They saw this property of straight lines 
as more or less “obvious.” 

Using the justification of arc length in this section 
and the triangle inequality of Section 1.5, argue that if œ 
is the straight-line path co(t) = tP + (1 — t)Q between P 
and Q in R3, then 


proper time (AB) +proper time (BC) <proper time (AC). 


L(@) < L(g 


for any other path cjoining P and Q. 


figure 4.2.6 The relativistic triangle inequality. 


4.3 Vector Fields 


The Concept of a Vector Field 


In Chapter 2 we introduced a particular kind of vector field, the gradient. In this section 
we study general vector fields, discussing their geometric and physical significance. 


Vector Fields A vector field in R" isa map F: A c R" —> R” that assigns to 
each point xin its domain A a vector F(x). If n = 2, F is called a vector field in 
the plane, and if n = 3, F is a vector field in space. 


Picture F as attaching an arrow to each point (Figure 4.3.1). By contrast, a map 
f:A C R” — R that assigns a number to each point is a scalar field. A vector field 


F(x) 
if Lr figure 4.3.1 A vector field F assigns a vector 
F(x) to each point x of its domain. 


example | 


example 2 


example 3 
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F(x, y, z) on R? has three component scalar fields F, Fz, and F 3, so that 
F(x, y, z) = (Filx, y, z), Fa(X, y, Z), Fa(x, y, 2)). 


Similarly, a vector field on R" has n components F1, ..., Fn. If each component is a 
Ck function, we say the vector field F is of class C *. Vector fields will be assumed to 
be at least of class C + unless otherwise noted. In many applications, F(x) represents a 
physical vector quantity (force, velocity, etc.) associated with the position x, as in the 
following examples. 


The flow of water through a pipe is said to be steady if, at each point inside the pipe, the 
velocity of the fluid passing through that point does not change with time. (Note that 
this is quite different from saying that the water in the pipe is not moving.) Attaching to 
each point the fluid velocity at that point, we obtain the velocity field V of the fluid (see 
Figure 4.3.2). Notice that the length of the arrows (the speed), as well as the direction 
of flow, may change from point to point. 


figure 4.3.2 A vector field describing 
J) the velocity of flow in a pipe. A 


S 


Some forms of rotary motion (such as the motion of particles on a turntable) can be 
described by the vector field 


V(x, y) = -yi- xj. 
See Figure 4.3.3, in which we have shown instead of V the shorter vector field iV So 


that the arrows do not overlap. This is a common convention in drawing pictures of 
vector fields. 


fil Fe 3 
ieee. ee 
iA) pe) ah ee 4 
L\vsc tert 
4 heer rs A) 
+ ere eS 


In the plane, R?, let the vector field x be defined by 


yi xj y x 
Vix, y) = Xy x2+y2 esa =) 
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figure 4.3.4 The vector field 
describing circular flow in a tub. 


example 4 


example 5 


(except at the origin, where V is not defined). This vector field is a good approximation 
to the planar part of the velocity of water flowing toward a hole in the bottom of a tub 
(Figure 4.3.4). Notice that the velocity becomes larger as you approach the hole. A 


Gradient Vector Fields 


In Section 2.6 we introduced the gradient of a function by 


of of of 
Vf(x,y,z) = —(x, y,z)i+ — (x, y, Zj + —(% y, zk 
(yz) = Ky i+ yí y 2j + (x y, 2) 
Now we want to think of this as an example of a vector field— it assigns a vector to each 
point (x, y, z). As such, we refer to V f as a gradient vector field. Gradient fields come 
up in a variety of situations, as the next two examples show. 


A piece of material is heated on one side and cooled on another. T he temperature at each 
point within the body is described at a given moment by a scalar field T (x, y, z). The 
flow of heat may be marked by a field of arrows indicating the direction and magnitude 
of the flow (Figure 4.3.5). This energy or heat flux vector field is given by J = —kVT, 
where k > 0 is a constant called the conductivity and VT is the gradient of the real- 
valued function T . Level sets of T are called isotherms. N ote that the heat flows from 
hot regions toward cold ones, since —VT points in the direction of decreasing T . 


Heat flux vector 


figure 4.3.5 A vector field describing 
the direction and magnitude of heat 
flow. 


A 


The force of attraction of the earth on a mass m can be described by a vector field called 
the gravitational force field. Place the origin of a coordinate system at the center of the 
earth (assumed spherical). A ccording to Newton's law of gravity, this field is given by 


_mMG 


F=-—, 


r, 


where r(x, y, z) = (x, y, z), andr = ||r|| (See Figure 4.3.6). The domain of this vector 
field consists of those r for which ||r|| is greater than the radius of the earth. As we saw 
in Example 6, Section 2.6, F is a gradient field, F = — VV , where 
_mMG 

r 


V = 


is the gravitational potential. N ote again that F points in the direction of decreasing V . 
Writing F in terms of components, we see that 


—mMG —mMG —mMG 
F(x, y,z) = 73 X, "E y, 3 z). 
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} 


m Da M 

NF a 

~, wee 
a y figure Te The vector field F given by 
7 Newton's law of gravitation. 
a 

e7 ; A < 

a Z T 


pty a 


example 6 | According to Coulomb’s law, the force acting on a charge e at position r due to a charge 
Q at the origin is 


Pa 
ma 


eQe 


where V = eQe/r and € is a constant that depends on the units used. For Qe > 0 
(like charges) the force is repulsive [Figure 4.3.7(a)], and for Qe < 0 (unlike charges) 
the force is attractive [Figure 4.3.7(b)]. Because the potential V is constant on the 
level surfaces of V , they are called equipotential surfaces. N ote that the force field is 
orthogonal to the equipotential surfaces (the force field is radial and the equipotential 
surfaces are concentric spheres). 


\ | \ | 
— 22 — fo 
eg Wi 
a 
TE as. 


figure 4.3.7 The vector fields associated with (a) like charges 
(Qe > 0), and (b) unlike charges (Qe < 0). A 


The next example shows that not every vector field is a gradient. 


example 7 | Show thatthe vector field Von R? defined by V(x, y) = yi— xj is nota gradient vector 
field; that is, there is no C 1 function f such that 


of. əf. 
V(x, y) = vf(x, y) = TU 3y) 
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solution 


Suppose that such an f exists. Then əf /ax = y and af /ay = —x. Because these are C ! 
functions, f itself must have continuous first- and second-order partial derivatives. B ut, 
əf /ax dy = —1, and 3?f /ay ax = 1, which violates the equality of mixed partials. 
Thus, V cannot bea gradient vector field. A 


Conservation of Energy and Escaping the Earth's 
Gravitational Field 
Consider a particle of mass m moving in a force field F that is a potential field. That is, 


assume F = —VV for a real-valued function V , and that the particle moves according 
to F=ma Thus, if the path is r(t), then 


mir(t) = —VV (r(t)). (1) 


A basic fact about such motion is the conservation of energy. The energy E of the 
particle is defined to be the sum of the kinetic and potential energies, defined as 


E = Emitt? +V (rtt). (2) 


The principle of conservation of energy states that if N ewton’s second law holds, then E 
is independent of time; that is, dE /dt = 0. The proof of this fact is a simple calculation; 
we use equation (2), the chain rule, and equation (1): 


É -mr t+(VV)- 
=f-(-VV +VV)=0. 


Escape Velocity 


As an application of conservation of energy, we compute the velocity required for a 
rocket to escape the earth’s gravitational influence. Assume the rocket has mass m and 
is at a distance Ro from the center of the earth (or another planet) when its escape 
velocity ve has been reached, and that it will coast thereafter. The energy at this time is 


1, mMG 


5 Us a (3) 


By conservation of energy, Eo will equal the energy at a later time, which we write as 
Eo = E = mv — —_, (4) 


where v is the velocity and R is the distance from the center of the earth (or the other 
planet). What we mean by the term escape velocity is that ve is chosen such that the 
rocket gets to great distances, at which time it is barely moving. That is, v is close to 
zero and R is very large. Thus, from equation (4), we seethat E = 0 and hence Ey = 0; 
solving Eo = 0 for ve using equation (3) gives: 


2M G 
Ve = 4| —. 


Ro 
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Now GM /Ré is exactly g, the acceleration due to gravity at the distance Ro from the 
center of the planet. Thus, we can write: 


Ve = 2gRo. 


For the earth, if the escape velocity were to be achieved at the surface of the earth (of 
course, this is a bit unrealistic), this would give 


ve = y/2- 9.8 m/s?- 6,371,000 m = 11,127 mis. 
However, this is a good approximation to the velocity that a satellite in low earth orbit 
needs in order to escape the earth’s gravitational field. 


Flow Lines 


An important concept related to general (not necessarily gradient) vector fields is that 
of a flow line, defined as follows. 


Flow Lines If F is a vector field, a flow line for F is a path dt) such that 


c(t) = F(dt)). 


Inthe context of Example 1, a flow line is the path followed by a small particle suspended 
inthe fluid (Figure 4.3.8). Flow lines arealso appropriately called streamlines or integral 
curves. 

Geometrically, a flow line for a given vector filed F is a curve that threads its way 
through the domain of the vector field in such a way that the tangent vector of the curve 
coincides with the vector field, as in Figure 4.3.9. 


Velocity vector Flow line 


Nie figure 4.3.8 The velocity vector of 


on A 
fA 
l | | figure 4.3.9 A flow line threading its way 


ee f through a vector field in the plane. 
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example 8 


solution 


Historical Note 


A flow line may be viewed as a solution of a system of differential equations. Indeed, 
we can write the definition c(t) = F(dt)) as 


x'(t) = P (x(t), y(t), z(t)), 
y(t) = Q (x(t), y(t), z(t)), 
z'(t) = R(x(t), y(t), z(t)), 


where dt) = x(t)i+ y(t)j + z(t)k, and where 
F=Pi+Qj+Rk 


You probably learned about such systems in courses on differential equations, but we 
are not presuming such a course has been taken. 


Show that the path dt) = (cost, sint) is a flow line of the vector field F(x, y) = 
—yi-+ xj. Can you find others? 


We must verify that c(t) = F(dt)). The left side is (—sint)i+ (cost)j, while the right 
side is F(cost, sint) = (—sint)i+ (cost)j, so we have a flow line. As suggested by 
Figure 4.3.3, the other flow lines are also circles. They have the form 


dqt) = (r cos(t — to), r sin(t — to)) 


for constantsr and tọ. A 


In many cases, explicit formulas for flow lines are not available, so we must resort 
to numerical methods. Figure 4.3.10 shows some output from a program that computes 
flow lines numerically and plots them on the screen. 


figure 4.3.10 Computer-generated 
integral curves of the vector field 

Fox, y) = (sin Yi + (x? — yj. This figure was 
created using 3D-XplorMath, available 
from Richard Palais' Web site at 
http://3D-XplorMath.org. 


AN 
EAN 


The Field Concept 


The concept of a “field,” such as a vector field, has had an enormous impact on 
the development of conceptual frameworks for physics and engineering. It is 
truly one of the great breakthrough ideas in the history of human thought. It is the 
notion that allows us to describe, in a systematic way, influences on objects and 
between objects that are spatially separated. 


exercises 
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The idea of a field began with Newton’s concept of the gravitational field. In 
this case, the gravitational field describes the attractive influence of a group of 
bodies on each other. Similarly, the electric field produced by a charged object 
or group of objects creates, according to Coulomb's law, a force on another 
charged object. Using vector fields to describe these sorts of forces has led to a 
deeper understanding of attractive and repulsive forces in nature. 

However, it was the monumental discovery of the Maxwell field equations, 
which describe the propagation of electromagnetic energy, that cemented the 
concept of “field” in scientific thought. This example is particularly interesting 
because these fields can propagate. The contrast between the electromagnetic 
field that can propagate and the gravitational field that involves instantaneous 
action at a distance has caused great interest among philosophers of science. 

Einstein’s idea is that gravitation can be described in terms of the metric 
properties of space-time and that in this theory the associated field can also 
propagate, just like the electromagnetic field, thus providing profound 
philosophical evidence that Einstein’s version of gravity may be correct. These 
ideas have also led to modern efforts to detect gravitational waves. For a further 
discussion of Einstein’s work, see Section 7.7. 

The idea of a field is also used in engineering to describe elastic systems and 
interesting microstructural properties of materials. In modern theoretical physics, 
the field concept is used to describe elementary particles and is central to 
attempts by modern theoretical physicists to unify gravity with the quantum 
mechanical physics of elementary particles. It is impossible to imagine a 
modern theoretical framework that does not incorporate some sort of field 
concept as a central ingredient. 


In Exercises 1 to 8, sketch the given vector field or a small multiple of it. 


L F(x, y) = (2, 2) 
2. F(x, y) = (4, 0) 
3. F(x, y) = (x, y) 
@ F(x, y) = (—x, y) 
5. F(x, y) = (2y, x) 


6. F(x, y) = (y, —2x) 


_ X y 
H (a or) 


8 Fx, y= ( eee ) 
Very? Vitae 


In the following two exercises, match the given vector field with its pictorial description (see Figures 4.3.11 and 4.3.12). 


9. (a) V(x, y) =xiF yj 
(b) V(x, y) = yi- xj 


10. (a) Vix, y) = 


X . y . 
(b) V(x, y) = i+ j 
VÊ +yY yy? 
W here are these vector fields not defined? H ow are these 
vector fields related to those in Problem 9? 


X 


Y i 
Veer 


7 yx? + ye 


In Exercises 11 to 14, sketch a few flow lines of the given vector field. 


IL F(x, y) =(y,—-x) 
12. F(x, y) = (x, -y) 


13. F(x, y) = (x, x?) 
14. F(x, y,z) = (y, —x, 0) 
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(ii) 


In Exercises 15 to 18, show that the given curve dt) is a flow line of the given velocity vector field F(x, y, z). 


15. dt) = (e, log |t], 1/t), t 4 0; F(x, y, z) = 
(2x, z, =z?) 


16. dt) = (t?, 2t — 1, vT), t > 0; F(x, y, z) = 
(y +1,2, 1/22) 


17. dt) = (sint, cost, et); F(x, y, z) = ( y, —x, z) 
1 l 4 2 
18. dt) =(3 è ; pF y, 2) =(—32", y, —2°) 
19. Let F(x, y, z) = (x2, yx?, z+ zx) and 
dqt) = an 0, nm Show dt) isa flow line for F. 


20. Show that qt) = (a cost — b sint, a sint + b cost) isa 
flow line for F(x, y) = (—y, x) for all real values of a 
and b. 


2L (a) Let F(x, y,z) = (yz, xz, xy). Find a function 
f:R? —> R such that F = Vf. 
(b) Let F(x, y, z) = (x, y, z). Find a function 
f:R? —> Rsuch that F = V f. 


22. Let f(x, y) = x? + y?. Sketch the gradient vector field 
V f together with some level sets of f. How are they 
related? 


23. Show that it takes half as much energy to launch a 
satellite into an orbit just above the earth as it does to 
escape the earth. (Ignore the rotation of the earth.) 


24. Let dt) bea flow line of a gradient field F = —VV. 
Prove that V (e{t)) is a decreasing function of t. 


25. Suppose that the isotherms in a region are all concentric 
spheres centered at the origin. Prove that the energy flux 
vector field points either toward or away from the origin. 


26. Sketch the gradient field —VV for 
V (x, y) = (x + y)/(x? + y?) and the equipotential 
surface V =1. 

27. Let F(x, y, z) = (xeY, y2z2, xyz) and suppose 
dt) = (x(t), y(t), z(t)) is a flow line for F. Find the 
system of differential equations that the functions 
x(t), y(t), and z(t) must satisfy. 
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4.4 Divergence and Curl 


For each of the divergence and curl operations, we will make use of the del operator, 
defined by 


a a a 
vei -L L 
i T ae 


(a a a 
E Oxy! aX’ ' OXn f 


For functions of one variable, taking a derivative can be thought of as an operation 
or process; that is, given a function y = f(x), its derivative is the result of operating 
on y by the derivative operator d /dx. Similarly, we can write the gradient as 


ð af of 
Vf= lt Pel) ag 


In n-space 


oy ay 
for functions of two variables, and 


of af af 
ay Hi tk 


ə ə a 
vi =(i—+j—+k-\|f= 
(Sat =) 


for three variables. In operational terms, the gradient of f is obtained by taking the V 
operator and applying it to f. 


Definition of Divergence 
We define the divergence of a vector field F by taking the dot product of V with F. 


Divergence If F = Fii + F2j + F3k, the divergence of F is the scalar field 


OF oF OF 
pri aid T ale) 


divF=V-F= : 
əx ay. az 


Similarly, if F = (F1, ..., Fa) is a vector field on R", its divergence is 


n 
i OF; OF, OF OF , 
div F = = bis i 
2 OX; 0X1 + 0X2 T j OXn 
example | Compute the divergence of 
F = x?yi+ zj + xyzk 


solution 3 3 
div F = z” x?y) + —(z) + + (xyz) = 2xy +0 + xy = 3xy 
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example 2 


solution 


< 


figure 4.4.1 Flowing a region W 
along the flow lines of a vector 
field. 


Interpretation 


The divergence has an important physical interpretation. If we imagine F to be the 
velocity field of a gas (or a fluid), then div F represents the rate of expansion per unit 
volume under the flow of the gas (or fluid). If div F < 0, the gas (or fluid) is compressing. 
For a vector field F(x, y) = Fii-+ F 2j on the plane, the divergence 


oop ĉr OF 
ax ay 


measures the rate of expansion of area. 

This interpretation is explained graphically, as follows. Choose a small region W 
about a point X. For each point xin W, let x(t) be the flow line emanating from x 
The set of points x(t) describe how the set W flows after time t (see Figure 4.4.1). 

Call the region that results after time t has elapsed W (t), and let V(t) be its volume 
(or areain two dimensions). Then the relative rate of change of volume is the divergence; 
more precisely, 


1 d , 
mo a”) ri div F(X), 


with the approximation being more exact as W shrinks to %. A direct proof of this is 
given in the Internet supplement, but a more natural argument is given in Chapter 8, in 
the context of the integral theorems of vector calculus. 


Consider the vector field in the plane given by V(x, y) = xi. Relate the sign of the 
divergence of V with the rate of change of areas under the flow. 


We think of V as the velocity field of a fluid in the plane. The vector field V points to 
the right for x > 0 and to the left if x < 0, as we see in Figure 4.4.2. The length of V 
gets shorter toward the origin. As the fluid moves, it expands (the area of the shaded 
rectangle increases), so we expect div V > 0. Indeed, div V = 1. 


example 3 


example 4 


example 5 
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y 
A 
< ~< > > 
i Bi = = figure 4.4.2 This fluid is expanding. 
>- |] 
> X 
wat i aa a 
< -< a oa 
A 


The flow lines of the vector field F = xi+ yj are straight lines directed away from the 
origin (Figure 4.4.3). 


y 
i a S EF PaA 
Ten N OX a a 
AAA AO ey, a, 
i a ae i UAN NOS 


figure 4.4.3 The vector field F(x, y) = xi+ yj. 


If these flow lines are those of a fluid, the fluid is expanding as it moves out from the 
origin, so div F should be positive. In fact, 
f] 


0 


Consider the vector field F = —xi — yj. Here the flow lines point toward the origin 
instead of away from it (see Figure 4.4.4). Therefore, the fluid is compressing, so we 
expect (div F) < 0. Calculating, we see that 


V-F=—(=x) + —(-y) =-1-1=-2 <0. 


As we saw in the last section, the flow lines of F = —yi+ xj are concentric circles about 
the origin, moving counterclockwise (see Figure 4.4.5). From this figure, it appears that 
the fluid is neither compressing nor expanding. This is confirmed by calculating 

ð ð 


ES a ee i 
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example 6 | Some flow lines of F = xi — yj are shown in Figure 4.4.6. Here our intuition about 
expansion or compression is less clear. H owever, it is true that the shaded regions shown 
have the same area, and we calculate that 


a a 
V-F= —x+ —(-y) =1+4+(-1) =0 
at ay! y) =1+(-1) 
y Fluid particles move from 
i shaded region to shaded 


region after a fixed time 


y y d 
y Z J y “i interval. The two areas 
xf ¥ 
we x Y 


are the same. 


aa KF y y yA a 
<—_<—_~<—__<— zj > xX 
~~ KA a a a 


figure 4.4.6 The vector field F(x, y) = xi — yj. A 


4.4 Divergence and Curl 249 


Curl 


To calculate the curl, the second basic operation performed on vector fields, we take the 
cross product of V with F. 


Curl of a Vector Field If F = Fii+ F2j + F3k, the curl of F is the vector 


field 
ij k 
curl F =V x F= gone os 
ax ay az 
Fi Fo Fa 


OF;  dF2\, (dF, aF3\, (dF. dF 
= (3 -i+ 1 3 (ee 2 1\k 
əy az əz əx əx ay 


If we write F = P i+ Qj + Rk, which is alternative notation, the same formula for 
the curl reads 


curl F = 


OL|om 
PROF 


i 
= 
ox 
P 
-(< =) (= DRE ~)k 
oy OZ OX ðZ Ox oy 


example 7 | LetF(x, y,z) =xi+xyj+k Find V x F. 


solution | We use the preceding formula: 
i j k 
ð ə ə 4 : 
VxF= ax By 32 = (0 — 0)i — (0 — 0)j + ( y — 0)k. 
x xy 1 


Thus, VxF=yk A 


example 8 | Find thecurl of xyi — sin zj +k 


solution Letting F = xyi — sin zj + k, 


i j k 
varel = 
ax ay az 
xy —sinz 1 
ð 0 ð ə ð a 
=| əy azji—jax az|j+ |ex ay İk 
—sinz 1 xy 1 xy —sinz 


coszi-xk A 
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example 9 


Unlike the divergence, which can be defined in R” for any n, we define the curl only 
in three-dimensional space (or for planar vector fields, regarding their third component 
as zero). 


The Curl and Rotations 


The physical significance of the curl will be discussed in Chapter 8, when we study 
Stokes’ theorem. H owever, we can now consider a specific situation, in which the curl 
is associated with rotations. 


Consider asolid rigid body B rotating about an axis L . The rotational motion of the body 
can be described by a vector œ along the axis of rotation, the direction being chosen so 
that the body rotates about œ, as in Figure 4.4.7. We call œ the angular velocity vector. 
Thelength w = ||@|| is taken to be the angular speed of the body B ; that is, the speed of 
any pointin B divided by its distance from the axis L of rotation. The motion of points in 
the rotating body is described by the vector field wwhose value at each point is the veloc- 
ity at that point. To find v, let Q be any pointin B and let œ be the distance from Q to L. 

Figure 4.4.7 shows that œ = ||r|| sin 6, where ris the vector whose initial point is 
the origin and whose terminal point is Q and 8 is the angle between rand the axis L of 
rotation. The tangential velocity vof Q is directed counterclockwise along the tangent 
to a circle parallel to the xy plane with radius œ and has magnitude 


Ivi = oa = orl] sin 8 = |lal||iri sin 6. 


The direction and magnitude of vimply that v= œ x r. Selecting a coordinate system 
in which L isthe z axis, we can write w = wkand r= xi + yj + zk. Thus, 


V=@xr=—oyi+ oxj, 
and so 
i j k 
curl v ee ae! = 20k= 2w 
ox dy dz 
—wy wx 0 


Hence, for the rotation of a rigid body, the curl of the velocity vector field is a vector 
field whose value is the same at each point. It is directed along the axis of rotation with 
magnitude twice the angular speed. 


figure 4.4.7 The velocity v and angular velocity 
æ of a rotating body are related by V = w x r. 


figure 4.4.8 Looking ata 
paddle wheel from above a 
moving fluid. The velocity field 
Vox, y, D = (yi — x)/O? + y’) is 
irrotational; the paddle wheel 
does not rotate around its axis w. 


example 10 


solution 
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The Curl and Rotational Flow 


If a vector field represents the flow of a fluid, then the value of V x F at a point is 
twice the angular velocity vector of a rigid body that rotates as the fluid does near that 
point. In particular, V x F = Oat a point P means that the fluid is free from rigid 
rotations at P; that is, it has no whirlpools. Another justification of this idea depends 
on Stokes’ theorem from Chapter 8. However, we can say informally that curl F = O 
means that if a small rigid paddle wheel is placed in the fluid, it will move with the 
fluid but will not rotate around its own axis. Such a vector field is called irrotational. 
For example, it has been determined from experiments that fluid draining from a tub is 
usually irrotational except right at the center, even though the fluid is “rotating” around 
the drain (see Figure 4.4.8). In Example 10, the flow lines of the vector field V are circles 
about the origin, yet we show that the flow is irrotational. Thus, the reader should be 
warned of the possible confusion the word “irrotational” can cause. 


Verify that the vector field 


is irrotational when (x, y) 4 (0, 0) (i.e, except where V is not defined). 


The curl is 
i j k 
ð ð 0 
VxV=| ax ay az 
y —X 


x? + y? x24 y? 


oan n Taf x ð y 
=01+0+| > (ara) nites ls 


| [=x + y?) + 2x? n —(x? + y?) + 2y? 
| Se yA? (x2 + y?)? 


|k=0 i 
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example 11 


solution 


Gradients Are Curl Free 


The following identity is a basic relation between the gradient and curl, which should 
be compared with the fact that for any vector v, we have vx v = Q 


Theorem 1 Curl of a Gradient For any C? function f, 


¥ x (Wi) =@ 


That is, the curl of any gradient is the zero vector. 


proof Because Vf = (df /əx, df /ay, af /dz) we have, by definition, 
ij k 
ð 0 ə 
Vx Vf=Jax ay əz 
of of əf 

ax ay az 


( af ə?f )i ( əf af )i ( af af ) 
i+ j+ k. 
ayaz  azay əzə% ax az ax ay ay ax 


Each component is zero because of the equality of mixed partial derivatives. E 


Theconverse to this theorem (a vector field with zero curl isa gradient, under suitable 
hypotheses) will be discussed in Chapter 8. 


Let V(x, y, z) = yi— xj. Show that Vis not a gradient field. 


If V were a gradient field, then it would satisfy curl W = Oby Theorem 1. But 


ij k 
o ð g 
y —x 0 


so V cannot be a gradient. A 


Scalar Curl 


There is an operation on vector fields in the plane that is closely related to the curl. 
If F = P(x, y)i+ Q(x, y)j is a vector field in the plane, it can also be regarded as a 
vector field in space for which the k component is zero and the other two components 


example 12 


solution 


example 13 


solution 


4.4 Divergence and Curl 253 


are independent of z. The curl of F then reduces to 


and always points in the kdirection. The function 


aQ.saP 


ax day 
of x and y is called the scalar curl of F. 


Find the scalar curl of V(x, y) = —y*i+ xj. 


The curl is 


so the scalar curl, whichis the coefficient of K, is 1 +2y. A 


Curls Are Divergence Free 


A basic relation between the divergence and curl operations is given next. 


Theorem 2 Divergence of a Curl For any C ? vector field F, 
div curl F=V-(V x F) =0. 


Thatis, the divergence of any curl is zero. 


As with the curl of a gradient, the proof rests on the equality of the mixed partial 
derivatives. You should write out the details. A converse will be discussed in Chapter 8. 


Show that the vector field V(x, y, z) = xi + yj + zk cannot be the curl of some vector 
field F; that is, there is no F with V = curl F. 
If this were so, then div V would be zero by Theorem 2. But 


yy OX OY AZ 
ON ag op a 


so V cannot be curl FforanyF. A 
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example 14 


solution 


Laplacian 


The Laplace operator V7, which operates on functions f , is defined to be the divergence 
of the gradient: 


_ of P əf E a°f 
= əx? ay əz? 


v?f=V.(Vf) 


This operator plays an important role in many physical laws, as we have mentioned in 
Section 3.1. 
The next example is important in mathematical physics. 


Show that V? f = 0 for 


1 1 


VE+yY+Z r 


where r = xi+ yj + zkandr = |r|. 


f(x, y,z) = and (x,y,z) Æ (0, 0, 0), 


The first derivatives are 


of —X of -y af -=z 


əx (x?-+y2+22)3/2" əy | (x2@+y2422)3/2" əz (x? +y? +z?) 


Computing the second derivatives, we find that 


əf 7 3x2 1 
Ox2 KEYZ) [X2 + y? +. 22)3/2" 
af 3y? 1 
3y? (x2 + y2+.22)5/2 (X2 +y? + 22)372" 
ə f B 372 1 
az2 (x2 +y2472)52 (x2 + y2+422)32° 
Thus, 
f f əf E 3(x? + y? + z?) 3 
3x2 3y? əz? a (x2 + y2 + 22)5/2 (x2 + y? + 22)3/2 


3 3 si 
(x2 + y2 +z?) (x24 y2422)3/2 C A 


Vector Identities 


We now have these basic operations on hand: gradient, divergence, curl, and the L aplace 
operator. The following box contains some basic general formulas that are useful when 
computing with vector fields. 


example 15 


solution 
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Basic Identities of Vector Analysis 

1. Vif +9) =Vf+Vg 

2. V(cf) =cVf, for a constant c 

3. V(fg) = fVg+gvet 

4. V(f/g) =(gVf — f Vg)/g?, at points xwhere g(x) 4 0 
5. div (F + G) = div F + div G 

6. curl (F + G) = curl F + curl G 

7. div (f F) = fdivF+F-Vf 

8. div (F x G) = G- curl F — F - curl G 

9. div curl F = 0 

10. curl (f F) = fcurlF+VfxF 

ii, Guill Wi = 0 

12, Veta =f vo oyi ATi o vA) 


13. div (Vf x Vg) = 0 


14. div (f Vg —gvf) = f V?g — gV? f 


Prove identity 7 in the preceding box. 


The vector field f F has components f F;, fori = 1, 2, 3, and so 


div ( f F) = (fF) + ŠUT Fa) + SUFF). 


a 
ax ay 


However, (3/3x)( f Fi) = fdF1/ax + Fi0f/dx by the product rule, with similar ex- 
pressions for the other terms. Therefore, 


div (f F) = (= OF a ae) the af af 


F F 
OX oy ðZ Ta 


ax ay az 


Let us use these identities to redo Example 14. 
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example 16 | Show thatforr4 Q V?(1/r) = 0. 


solution As in the case of the gravitational potential, V(1/r) = —r/r?. In general, V(r") = 
nr ”=2r (see Exercise 38). By the identity V-(f F) = fV-F+ Vf -F, weget 


r 1 1 
V-(—)=3V-ré¢r-V( = 
r3 r r3 


3 
3 —3r 3 3 
=a il 75 =3 pu) A 


Divergence and Curl 


William Rowan Hamilton, in his investigation of quaternions (discussed in 
Histori c al N ote Section 1.3), introduced the del operator, defined formally as 
RA TEA TE 
~ ax ay? az 


Hamilton firmly believed in the significance of this operator. If f(x, y, z) isa 
scalar function on R, then “multiplication” by V gives the gradient of f: 


which, of course, gives the direction of steepest ascent (see Section 2.6). If 
V(x yZ) = V(x yi + V(x yZ + V(x yzk 
is a vector field, then the “quaternionic multiplication” of V with V yields 
VV = -div V + curl V. 


Thus, what we now call the divergence of V is the negative of the scalar part of 
this product, and curl V is the vector part (c.f. the quaternion discussion in 
Section 1.3). 

As far as we are aware, Hamilton never gave a physical interpretation of 
divergence and curl, but he surely believed that, as a consequence of his faith in 
them, they must have an important physical interpretation. His faith in his 
mathematical formalism was justified, but a physical explanation of divergence 
and curl had to wait for James Clerk Maxwell’s Treatise on Electricity and 
Magnetism. Here, Maxwell used both the divergence and the curl in his 
equations for the interaction of electric and magnetic fields (the Maxwell 
equations are discussed in Chapter 8). 

Curiously, Maxwell referred to divergence as convergence and to curl as 
rotation, a term still used in scientific research. It was Josiah Gibbs (Figure 4.4.9) 
who renamed convergence and rotation as the more familiar terms we use 
today—divergence and curl. 

Maxwell gave a physical interpretation of the divergence using the Gauss 
divergence theorem, as we do in Section 8.4. His physical interpretation of the 
curl as a rotation was rather brief. Gibbs provided a more elementary 
interpretation of divergence, as we do in this section. In the spirit of Leibniz (who 
believed in infinitesimal quantities dx,dy,dz), Gibbs imagined placing a small 
figure 4.4.9 Josiah Willard Gibbs cube of dimensions dx by dy by dzin a fluid. The faces of this cube have areas 
(1839-1903). dx dy, dy dz, and dx az. 
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At this point, you may be interested to hear Gibbs through the words of his student 


E. B. Wilson: 


Consider the amount of fluid which passes through those faces of the cube 
which are parallel to the yz plane, i.e., perpendicular to the x axis (see 
Figure 4.4.10). 

The normal to the face whose x coordinate is the lesser, that is, the 
normal to the left-hand face of the cube is —i. The flux of substance through 
this face is 


—i- V(x, y, z) dy az. 


baat 


—i dy dz i dy dz 


> X 


(x, y, 2) (x+dx, y, 2) 


figure 4.4.10 Cube with faces 
parallel to the yz plane. 


The normal to the opposite face, the face whose x coordinate is greater 
by the amount ax, is +i, and the flux through it is therefore 


av 
i- V(x+ OX, y, z2) dy dz = i. | V(x, y,z)+ TF dx| dy dz 


= İ. V(x, y, z) dy dz+ i. = dx dy az. 


The total flux outward from the cube through these two faces is therefore 
the algebraic sum of these quantities. This is simply 


aV avy 
i- ax dx dy az = = dx dy az. 
In like manner the fluxes through the other pairs of faces of the cube are 
. ov av 
j- iy dx dy dz and k. E dx dy dz. 


The total flux out from the cube is therefore 
i av av 

ay az 

This is the net quantity of fluid that leaves the cube per unit time. The 


quotient of this by the volume dx dy az of the cube gives the rate of 
diminution of density. This is 


av 


k 
ax + 


+j 


) dx dy az. 


aV 
a E i ale 
= ay 


aV ay, 
dz ax 


3V2 
əy 


3V3 
əz ` 
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exercises 


Because V - V thus represents the diminution of density or the rate at which 
matter is leaving a point per unit volume per unit time, it is called the 
divergence. Maxwell employed the term convergence to denote the rate 
at which fluid approaches a point per unit volume per unit time. This is the 
negative of the divergence. In the case that the fluid is incompressible, as 
much matter must leave the cube as enters it. The total change of contents 
must therefore be zero. For this reason, the characteristic differential 
equation that any incompressible fluid must satisfy is 


V-V=0, 


where V is the velocity of the fluid. This equation is often known as the 
hydrodynamic equation. It is satisfied by any flow of water, since water is 
practically incompressible. The great importance of the equation for work 
in electricity is due to the fact that according to Maxwell’s hypothesis, 
electric displacement obeys the same laws as an incompressible fluid. If, 
then, D is the electric displacement, 


divD=V-D=0. 


Gibbs’ interpretation of curl was much like the one we gave in Example 9 for the 
rotation of a rigid body. Wilson remarks that an analysis of the meaning of curl for 
fluid motion was “rather difficult.” It remains a bit elusive, even today, as can be 
seen from our discussion following Example 9. We provide another interpretation 
in Chapter 8. 


Find the divergence of the vector fields in Exercises 1 to 4. 


L V(x, y, z) = ei — ej + ek 

2 V(x, y, z) = yzi+ xzj + xyk 

3. V(x, y, z) = xi+(y+cosx)j+(z+e%)k 
4 Vx, y, z) = x?i+ (x + y)?j +(x +y+z)?k 


5. Figure 4.4.11 shows some flow lines and moving regions = 
for a fluid moving in the plane-field velocity field V. 5 


>< 


6. Let V(x, y, z) = xibe the velocity field of a fluid in 


Where is div V > 0, and also where is div V < 0? 


space. Relate the sign of the divergence with the rate of 
change of volume under the flow. 


» Sketch a few flow lines for F(x, y) = yi. Calculate V - F 


and explain why your answer is consistent with your 
sketch. 


8. Sketch a few flow lines for F(x, y) = —3xi — yj. 


Calculate V - F and explain why your answer is 
consistent with your sketch. 


figure 4.4.11 The flow lines of a fluid moving in the plane. 
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Calculate the divergence of the vector fields in Exercises 9 to 12. 


9. F(x, y) = xi — x sin (xy)j LL F(x, y) = sin (xy)i — cos (x?y)j 
10. F(x, y) = yi- xj 12. F(x, y)=xei—[y/(x+y)]j 
Compute the curl, V x F, of the vector fields in Exercises 13 to 16. 
13. F(x, y,z) =xi+yj+zk 15. F(x, y, z) = (x? + y? + z2?) (3i + 4j + 5k) 
14. F(x, y, z) = yzīi+ xzj + xyk 16. F(x, pjs o 
x? +y? +z 
Calculate the scalar curl of each of the vector fields in Exercises 17 to 20. 
17. F(x, y) = sin xi + cos xj 25. Suppose F: R? —> R? is aC ? vector field. Which of the 
following expressions are meaningful, and which are 
18. F(x, y) = yi- xj nonsense? For those which are meaningful, decide 
whether the expression defines a scalar function or a 
19. F(x, y) = xyi + (x? — y2)j vector field. 
20. F(x, y)=xi+ yj (a) curl(grad F) 
(b) grad(curl F)) 
2L Let F(x, y, z) = (X2, x2y,z+2x). (c) div(grad F) 
(a) Verify that V . (V x F) = 0. (d) grad(div F) 
(b) Can there exist a function f: R3? —> R such that (e) curl (div F) 
F = Vf? Explain. (f) div(curl F) 
22. (a) Which of the vector fields in Exercises 13-16 could 26. Suppose f, g, h: R — R are differentiable. Show that 
be gradient fields? the vector field F(x, y, z) = (f(x), g(y), h(z)) is 
(b) Which of the vector fields in Exercises 9-12 could irrotational. 


be the curl of some vector field V: R? > R?? 
27. Suppose f, g, h: R? > R are differentiable. Show that 
23. Let F(x, y, z) = (e%2, sin(xy), x°y3z2), the vector field F(x, y, z) = (fly, z), g(x, z), h(x, y)) 
: f has zero divergence. 
(a) Find the divergence of F. 


(b) Find the curl of F. 28. Prove identity 13 in the list of vector identities. 


24. Suppose f: R? — R is aC ? scalar function. Which of 
the following expressions are meaningful, and which are 
nonsense? For those which are meaningful, decide 
whether the expression defines a scalar function or a 


vector field. 

(a) curl(grad f ) (d) grad(div f) 
(b) grad(curl f)) (e) curl (div f) 
(c) div(grad f) (f) div(curl f) 


Verify that V x (V f ) = Ofor the functions in Exercises 29 to 32. 


29. f(x,y,z) = x2 +y2422 32 f(x, y, z) =x?y? + y?z? 


33. Show that F = y(cosx)i+ x(sin y)j is not a gradient 


30. f(x,y,z) =xy+yz+xz vector field. 


BL f(x,y,z) = 1/(x? + y? +27) 34. Show that F = (x? + y?)i — 2xyj is not a gradient field. 
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35. Prove identity 10 in the list of vector identities. 


36. Suppose that V - F = 0 and V- G = 0. Which of the 
following necessarily have zero divergence? 
(a) F+G 
(b) Fx G 


37. Let F = 2xz2i+j+ y?zxkand f = x?y. Compute the 
following quantities. 
(a) Vf 
(b) V xF 
(c) Fx Vf 
(d) F-(Vf) 


38. Let r(x, y, z) = (x, y, z) and 
r = /x2+y2 +z? = |[rl|. Prove the following 
identities. 
(a) V(1/r) = —r/r3,r + 0; and, in general, 
V(r") =nr-2rand V(logr) = r/r?. 
(b) V2(1/r) =0,r 40; and, in general, 
Vrn = n(n +1) r". 


(c) V-(r/r3) = 0; and, in general, 
V-(r%r) =(n+3)r", 


(d) V x r= Q and, in general, V x (r"r) = Q 
39. Does V x F have to be perpendicular to F? 


40. Let F(x, y, z) = 3x?yi + (x3 + y3)j. 
(a) Verify that curl F = Q 


(b) Find afunction f such that F = V f . (Techniques 
for constructing f in general are given in Chapter 8. 
The one in this problem should be sought by trial 
and error.) 


41. Show that the real and imaginary parts of each of the 
following complex functions form the components of an 
irrotational and incompressible vector field in the plane; 
herei = /—1. 

(a) (x= iy)? 
(b) (x = iy)? 
(c) eX-l¥ = eX(cosy — i sin y) 


review exercises for chapter 4 


For Exercises 1 to 4, at the indicated point, compute the velocity vector, the acceleration vector, the speed, and the equation of 
the tangent line. 


L dt) = (t? +1, et, cos (xt/2)), att = 1 
2 dt) = (t? — 1, cos (t2), t4), att = /7 


3. dt) = (e, sint, cost), att = 0 


t2 

4 dt) = —si+tj tt = 

dt) ie +tj+ka 2 

5. Calculate the tangent and acceleration vectors for the 
helix qt) = (cost, sint, t) att = 7/4. 


6. Calculate the tangent and acceleration vector for the 
cycloid dt) = (t—sint, 1—cost) att = 7/4 and sketch. 


7. Leta particle of mass m move on the path 
dt) = (t?, sint, cost). Compute the force acting on the 
particle att = 0. 


8. (a) Lett) bea path with |/¢t) || = constant; that is, the 
curve lies on a sphere. Show that ¢(t) is orthogonal 
to dt). 


(b) Let cbe a path whose speed is never zero. Show 
that c has constant speed if and only if the 
acceleration vector c” is always perpendicular to the 
velocity vector €. 


9. Let dt) = (cost, sint, /3t) be a path in R3. 


(a) Find the velocity and acceleration of this path. 


(b) Find a parametrization for the tangent line to this 
path att = 0. 


(c) Find the arc length of this path for t € [0, 27]. 


10. Let F(x, y, z) = (sin(xz), e¥Y, x2y3z9). 


(a) Find the divergence of F. 
(b) Find the curl of F. 


IL Verify that the gravitational force field 


X, Y, Z 
F(x, y,z) =—A 
(x, y, 2) (x2 + y? +22)372 where A issome 


constant, is curl free away from the origin. 


12. Show that the vector field V(x, y, z) = 2xi— 3yj + 4zk 
is not the curl of any vector field. 


13. Express the arc length of the curve x? = y? = z* 
between x = 1 and x = 4 as an integral, using a suitable 
parametrization. 


14. Find the arc length of dt) = ti+ (logt)j + 2/2tk for 
1<t <2. 


15. A particle is constrained to move around the unit circle 
in the xy plane according to the formula 
(x, y, Z) = (cos(t?), sin (t?), 0), t > 0. 


(a) What are the velocity vector and speed of the 
particle as functions of t? 


(b) Atwhat point on the circle should the particle be 
released to hit a target at (2, 0, 0)? (Be careful about 
which direction the particle is moving around the 
circle.) 


At what time t should the release take place? (Use 
the smallest t > 0 that will work.) 


W hat are the velocity and speed at the time of 
release? 


— 
O 


(d 


Compute V - Fand V x F for the vector fields in Exercises 21 to 24. 


2L F = 2xi+3yj+ 4zk 


22 F=xi+y?j+z?k 


25. F(x, y, z) = yi + zj + xk, at the point (1, 1, 1) 


27. f(x, y) =e% + cos(xy) 
x2 y? 

28. f(x,y) = ar 

29. f(x, y) = e% — cos (xy?) 

30. f(x, y) = tan! (x? + y?) 


3L (a) Let f(x, y, z) = xyz2; compute Vf. 
(b) Let F(x, y, z) = xyi + yzj + zyk; compute V x F. 


(c) Compute V x (f F) using identity 10 of the list of 
vector identities. Compare with a direct computation. 


32. (a) Let F= 2xye2i+ e7x?j + (x2ye? + z?)k. Compute 
V-FandV xF. 
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(e) At what time is the target hit? 


16. A particle of mass m moves under the influence of a 
force F = —kr, where k is a constant and r(t) is the 
position of the particle at time t. 


(a) Write down differential equations for the 
components of r(t). 


(b) Solve the equations in part (a) subject to the initial 
conditions r(0) = O r(0) = 2j + k. 


17. Write the curve described by the equations 
x—1= 2y +1 = 3z +2 in parametric form. 


18. Write the curve x = y? = z? + 1 in parametric form. 

19. Show that qt) = (1/(1 — t), 0, e' /(1 — t)) is a flow line 
of the vector field defined by F(x, y, z) = (x2, 0, 
z(1+x)). 


20. Let F(x, y) = f (x? + y2)[—yi+ xj] for a function f of 
one variable. W hat equation must g(t) satisfy for 


dt) = [cosg(t)]i+ [sin g(t) ]j 


to be a flow line for F? 


23. F=(x+y)i¢(y+z)j+(z+x)k 


24. F = xi+ 3xyj+zk 


Compute the divergence and curl of the vector fields in Exercises 25 and 26 at the points indicated. 


26. F(x, y,z) = (x + y)3i+ (sinxy)j + (cos xyz)k, at the 
point (2, 0, 1) 


Calculate the gradients of the functions in Exercises 27 to 30, and verify that V x Vf =O 


(b) Find afunction f(x, y, z) such that F = Vf. 


33. Let F(x, y) = f (x? + y2)[—yi+ xj], as in Exercise 20. 
Calculate div F and curl F and discuss your answers in 
view of the results of Exercise 20. 


34. Leta particle of mass m move along the elliptical helix 
qt) =(4 cost, sint, t). 


(a) Find the equation of the tangent line to the helix at 
t = x /4. 
(b) Find the force acting on the particle at timet = 7/4. 


(c) Write an expression (in terms of an integral) for the 
arc length of the curve et) between t = 0 and 
t=7/4. 
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35. (a) Let g(x, y, z) =x? + 5yz + z? and let h(u) bea 


function of one variable such that h’(1) = 1/2. Let 
f =hog. Starting at (1, 0, 0), in what directions is 
f changing at 50% of its maximum rate? 

(b) For g(x, y, z) =x? + 5yz + z2, calculate F = Vg, 
the gradient of g, and verify directly that V x F = 0 
at each point (x, y, Z). 


(a) Write in parametric form the curve that is the 
intersection of the surfaces x? + y? + z2? = 3 and 
y=1. 

(b) Find the equation of the line tangent to this curve at 
(1,1, 1). 

(c) Write an integral expression for the arc length of 
this curve. W hat is the value of this integral? 


. In meteorology, the negative pressure gradient G is a 
vector quantity that points from regions of high pressure 
to regions of low pressure, normal to the lines of 
constant pressure (isobars). 


(a) In an xy coordinate system, 


W rite a formula for the magnitude of the negative 
pressure gradient. 


(b) If the horizontal pressure gradient provided the only 
horizontal force acting on the air, the wind would 
blow directly across the isobars in the direction of 
G, and for a given air mass, with acceleration 
proportional to the magnitude of G. Explain, using 
Newton's second law. 


(c) Because of the rotation of the earth, the wind does 
not blow in the direction that part 
(b) would suggest. Instead, it obeys Buys—Ballot’s 
law, which states: “If in the Northern Hemisphere, 
you stand with your back to the wind, the high 
pressure is on your right and the low pressure is on 
your left.” Draw a figure and introduce xy 
coordinates so that G points in the proper direction. 


(d) State and graphically illustrate B uys-B allot’s law for 
the Southern Hemisphere, in which the orientation 
of high and low pressure is reversed. 


A sphere of mass m, radius a, and uniform density has 
potential u and gravitational force F, at a distancer from 
the center (0, 0, 0), given by 


3m mr? m 
U= 2-5, F=-—r (r <a); 
2a 2a a3 <a) 
va” 
7 i r3 


Here, r = |r|, r= xi+ yj + zk. 


(a) Verify that F = Vu on the inside and outside of the 
sphere. 


(b) Check that u satisfies Poisson’s equation: 
d2u/ax? + d2u/ay? + a2u/az? = constant inside 
the sphere. 

(c) Show that u satisfies Laplace's equation: 
a2u/ax? + a2u/ay2 + d2u/az2 = 0 outside the 
sphere. 


A circular helix that lies on the cylinder x2 + y? = R? 
with pitch o may be described parametrically by 

x = Rosé, y = Rsin9, Z = 0, 0 >0. 
A particle slides under the action of gravity (which acts 

parallel to the z axis) without friction along the helix. 

If the particle starts out at the height zo > 0, then when 

it reaches the height z along the helix, its speed is 

given by 


d 
== ty — 7)29, 


where s is arc length along the helix, g is the constant of 
gravity, t is time, and 0 < Z < Zọ. 


(a) Find the length of the part of the helix between the 
planes z = zọ and Z = 21,0 < Z1 < Zọ. 

(b) Compute the time To it takes the particle to reach the 
plane z = 0. 


A sphere of radius 10 centimeters (cm) with center at 
(0, 0, 0) rotates about the z axis with angular velocity 4 
in such a direction that the rotation looks 
counterclockwise from the positive z axis. 


(a) Find the rotation vector w (see Example 9, in 
Section 4.4). 

(b) Find the velocity v= œ x rwhen r = 5./2(i— j) is 
on the “equator.” 

(c) Find the velocity of the point (0, 5/3, 5) on the 
sphere. 


Find the speed of the students in a classroom located at a 
latitude 49°N due to the rotation of the earth. (Ignore the 
motion of the earth about the sun, the sun in the galaxy, 
etc.; the radius of the earth is 3960 miles.) 


5 


Double and Iriple Integrals 


5.1 Introduction 


It is to Archimedes himself (c. 225 B.C.) that we owe the nearest approach to actual inte- 
gration to be found among the Greeks. His first noteworthy advance in this direction was 
concerned with his proof that the area of a parabolic segment is four thirds of the triangle 
with the same base and vertex, or two thirds of the circumscribed parallelogram. 


—D. E. Smith, 


History of Mathematics 


In this chapter and the next we study the integration of real-valued 
functions of several variables; this chapter treats integrals of functions 
of two and three variables, or double and triple integrals. Ihe double 
integral has a basic geometric interpretation as volume, and can be 
defined rigorously as a limit of approximating sums. We shall present sev- 
eral techniques for evaluating double and triple integrals and consider 


some applications. 


This section discusses some geometric aspects of the double integral, deferring amore 
rigorous discussion in terms of Riemann sums until Section 5.2. 


Double Integrals as Volumes 


Consider a continuous function of two variables f : R c R? —> R whose domain R isa 
rectangle with sides parallel to the coordinate axes. The rectangle R can be described in 
terms of the two closed intervals [a, b] and [c, d], representing the sides of R along the 
x and y axes, respectively, as in Figure 5.1.1. In this case, we say that R is the Cartesian 
product of [a, b] and [c, d] and write R = [a, b] x [c, d]. 

Assume that f(x,y) > 0 on R, so that the graph of z = f(x, y) is a surface 
lying above the rectangle R. This surface, the rectangle R, and the four planes x = a, 
x =b, y = c, and y = d form the boundary of a region V in space (see Figure 5.1.1). 

The problem of how to rigorously define the volume of V has to be faced, and we 
shall solve it in Section 5.2 by the classic method of exhaustion, or rather, in more 
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example | 


Graph of 
z=f y) 


yee figure 5.1.1 The region V in space is 


1 bounded by the graph of f, the rectangle 
g Region V 


R, and the four vertical sides indicated. 


modern terms, the method of Riemann sums. To gain an intuitive grasp of the double 
integral, we provisionally assume that the volume of a region has been defined. 


Double Integrals The volume of the region above R and under the graph of a 
nonnegative function f is called the (double) integral of f over R and is denoted 


by 
ff tya, or f| toyida. 


(a) If f is defined by f(x, y) = k, where k is a positive constant, then 


1 f(x, y) dA =k(b —a)(d =€), 


because the integral is equal to the volume of a rectangular box with base R and heightk. 
(b) If f(x,y) =1—x and R = [0,1] x [0, 1], then 


II Hx yida = 5, 


because the integral is equal to the volume of the triangular solid shown in Figure 5.1.2. 


figure 5.1.2 Volume under the graph z= 1 — x 
and over R = (0, 1) x (0, 1). 


example 2 


YA 


a b 


figure 5.1.4 Area under the 
graph of a nonnegative 
continuous function f from x = a 
tox=bis |? fx dx. 


figure 5.1.5 The sum of the 
areas of the shaded rectangles is 
a Riemann sum, which 
approximates the area under f 
from x=atox=b. 
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Suppose z = f(x,y) =x*+y? and R = [-1,1] x [0,1]. Then the integral 
Jfn(x? + y?) dxdy is equal to the volume of the solid sketched in Figure 5.1.3. We 
shall compute this integral in Example 3. 


z 


z=f@, y)=x +y 


figure 5.1.3 Volume under z = x* + y? and over 
R= (—1, 1) x (0, 1). 


A 


These ideas are similar to those for a single integral J? f (x) dx, which represents 
the area under the graph of f if f > 0; see Figure 5.1.4.1 

Single integrals J? f (x) dx can be rigorously defined, without recourse to the area 
concept, as a limit of Riemann sums. Theideais to approximate f? f (x) dx by choosing 
a partition a = Xọ < X1 < :-- < Xn = b of [a, b], selecting points c; € [Xi, X41], and 
forming the Riemann sum 


n—1 
f (ci) (Xia. — Xi) © f (x) dx 
Jy flea) f 


(see Figure 5.1.5). We examine the analogous process for double integrals in the next 
section. 


y 


a? xu aq x à x C) Hog, My FB 


Cavalieri’s Principle 


There is a useful method for computing volumes, known as Cavalieri’s principle. Sup- 
pose we have a solid body and we let A(x) denote its cross-sectional area in a plane P, 
measured at a distance x from a reference plane (Figure 5.1.6). 


1R eaders not already familiar with this idea should review the appropriate sections of their introductory 
calculus text. 
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figure 5.1.6 A solid body with 
cross-sectional area A(x) at 
distance x from a reference 
plane. 


figure 5.1.7 Volume of a slab 
with cross-sectional area A(x) 
and thickness Ax equals A(x) Ax. 
The total volume of the body is 
f? Aco ax. 


Historical Note 


A(x): area of 
cross section 


Reference plane 


According to Cavalieri’s principle, the volume of the body is given by 
b 
volume = i A(x) dx, 
a 


where a and b are the minimum and maximum distances from the reference plane. 
This can be made intuitively clear as follows. If we partition [a, b] into n-equal parts 
by taking a = Xp < Xı < --: < Xn = b, then if Ax = Xj41 — Xi an approximating 
Riemann sum for the preceding integral is 

n-1 n-1 
$7 A(Ci)(Xi42 X) = A Alc) AX. 
i=0 i=0 
But this sum also approximates the volume of the body, because A(x) Ax is the volume 
of a slab with cross-sectional area A(x) and thickness Ax (Figure 5.1.7). Therefore, itis 
reasonable to accept the preceding formula for the volume. A more careful justification 
of this method is given in the Internet supplement for Chapter 5. 


The Slice Method—Cavalieri’s Principle Let S be a solid and, for x 
satisfying a < x <b, let P, bea family of parallel planes such that: 


1. S lies between P, and Pp; 
2. The area of the slice of S cut by P, is A(x). 


[ A(x) dx. 


Bonaventura Cavalieri (1598-1647) was a pupil of Galileo and a professor in 
Bologna. His investigations into area and volume were important building blocks 
of the foundations of calculus. Although his methods were criticized by his 
contemporaries, similar ideas had been used by Archimedes in antiquity and 
were later taken up by the “fathers” of calculus, Newton and Leibniz. 


Then the volume of S is equal to 
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figure 5.1.8 Two different cross sections 
sweeping out the volume under 
Z= f(x,y). 


Reduction to Iterated Integrals 


We now use Cavalieri’s principle to evaluate double integrals. Consider the solid region 
under a graph z = f(x, y) defined on the region [a, b] x [c, d], where f is continuous 
and greater than zero. T here are two natural cross-sectional area functions: one obtained 
by using cutting planes perpendicular to the x axis, and the other obtained by using 
cutting planes perpendicular to the y axis. The cross section determined by a cutting 
plane x = Xo, of the first sort, is the plane region under the graph of z = f (Xo, y) from 
y =c to y = d (Figure 5.1.8). 

When we fix x = Xo, we obtain the function y > f (Xo, y), which is continuous on 
[c, d]. The cross-sectional area A (Xo) is, therefore, equal to the integral Ji f (Xo, y) dy. 
Thus, the n sectional area function A has domain [a, b], and is given by the rule 
Aix > rt (x, y) dy. By Cavalieri’s principle, the volume V of the region under 
z = f(x, y) must be equal to 


v = [noas f | f t yoyo 


Theintegral f? [so f (x, y) dy] dx is known as an iterated integral because it is obtained 


by ie rating with respect to y and then integrating the result with respect to x. Because 
Sfr f(x; y) dA is equal to the volume V , we get the following result. 


Double and Iterated Integrals 


ff renas f |f toner) ax (1) 


If we use cutting planes perpendicular to the y axis, we obtain 


[frena f [S tenaa (2) 


The expression on the right of formula (2) is the iterated integral obtained by 
integrating with respect to x and then integrating the result with respect to y. 
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Thus, if our intuition about volumes is correct, formulas (1) and (2) ought to be valid. 
This is in fact true when the concepts we are discussing are defined rigorously, and is 
known as F ubini’s theorem. We give a proof of this theorem in the next section. 

As the following examples illustrate, the notion of the iterated integral and equations 
(1) and (2) provide a powerful method for computing the double integral of a function 
of two variables. 


example 3 | Evaluate the integral 


Jf (x? + y°) dxdy, 


where R = [—1, 1] x [0, 1]. 


[[eryvroy= [ [fo vrai dy. 


To find i (x? +y?) dx, we treat y as a constant and integrate with respect to x. Because 
x3/3 + y2x is an antiderivative of x? + y? with respect to x, we can integrate, using the 
fundamental theorem of calculus, to obtain 


solution By equation (2), 


1 x3 1 2 
J (x2 + y?) dx = s+ = 3 t2y’. 


1 x=-1 


Next, we integrate 3 + 2y? with respect to y from 0 to 1 to obtain 


1/2 2 a A 
£ + 2y? =]|= Zy? =>. 
f (G + y) [y+ pl, 3 
Hence, the volume of the solid we saw in Figure 5.1.3 is 4/3. 


For completeness, let us evaluate ff (x? + y?) dxdy using equation (1)— that is, 
integrating with respect to y first and then with respect to x. We have 


[[ erna f [0+ ya ayo 


Treating x as a constant in the y integration, we obtain 


1 


1 y? 1 
[oetyoy= heyt] =x +5. 
0 3 y=0 3 


Next, we evaluate ft, (x? + }) dx to obtain 


Lepega 
Nou a oe | 


which agrees with our previous answer. A 


example 4 


exercises 


solution 
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Compute the double integral ff; cosx sin y dxdy, where S is the square [0, 2/2] x 
[0, 2/2] (see Figure 5.1.9). 


z= cos x sin y 


figure 5.1.9 Volume under z = cos xsin y 
and over the rectangle (0, 2/2) x (0, 7/2). 


By equation (2), 


x/2 x/2 
J [ c0sx siny dxdy = | p COS X siny x ay 
S 0 0 
a /2 a /2 x/2 
=i) siny| [ cosx ax] ay = f sin ydy = 1. i 
0 0 0 


Inthe nextsection, we shall useRiemann sumsto rigorously define the doubleintegral 
for a large class of functions of two variables without recourse to the notion of volume. 
Although we shall drop the requirement that f (x, y) > 0, equations (1) and (2) will 
remain valid. Therefore, the iterated integral will again provide the key to computing 
the double integral. In Section 5.3, we treat double integrals over regions more general 
than rectangles. 

Finally, we remark that it is common to delete the brackets in iterated integrals such 
as equations (1) and (2) and to write 


[ [tnay in place of [[ [tna dx 
f [ enaa in place of [| ene dy. 


and 


L Evaluate the following iterated integrals: 2 74 ry y 
(c) J | € + 2) dxdy 
i J2 


1 pl 
(a) I f (1 — x? + xy) dx dy 
0 Jo 


w/4 prn/4 
n/2 pr/2 (d) f | tan x sec? y dx dy 
(b) I J cos x sin y dx dy 0 0 
0 —n/2 
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2. Evaluate the integrals in Exercise 1 by integrating first 7. A lumberjack cuts out a wedge-shaped piece W of a 
with respect to y and then with respect to x. cylindrical tree of radius r obtained by making two saw 
cuts to the tree’s center, one horizontally and one at an 
3. Evaluate the following iterated integrals: angle 6. Compute the volume of the wedge W using 
l,l Cavalieri’s principle. (See Figure 5.1.12.) 
@ f f otysyiayas 
-1J0 


m/2 pl 

o f f byeosx +2) dyas 
o Jo 
1 pl 

(c) i f orea 
o Jo 
0 p2 

(d) J J i-xiogyidyax 
-1 J1 


4. Evaluate the integrals in Exercise 3 by integrating with 
respect to x and then with respect to y. [The solution to 
part (b) only is in the Study Guide to this text.] 


5. Use Cavalieri’s principle to show that the volumes of two 
cylinders with the same base and height are equal (see 
Figure 5.1.10). 


figure 5.1.12 Find the volume of W. 


8. (a) Show that the volume of the solid of revolution 
shown in Figure 5.1.13(a) is 


b 
a f [f(x)]? dx. 
a 


(b) Show that the volume of the region obtained by 
rotating the region under the graph of the parabola 
y = —x* + 2x + 3, —1 < x < 3, about the x axis is 
5127/15 [see Figure 5.1.13(b)]. 


figure 5.1.10 Two cylinders with the same base and 
height have the same volume. y y 


y= rx? 42x43 


l oa a y=f@ y) 
6. Using Cavalieri’s principle, compute the volume of the x 


structure shown in Figure 5.1.11; each cross section is a 
rectangle of length 5 and width 3. 


-w (a) (b) 


figure 5.1.13 The solid of revolution (a) has volume z fre f(x)? dx. 


Part (b) shows the region between the graph of y= —x*+2x+3 
and the x axis rotated about the x axis. 


figure 5.1.11 Compute this volume. 
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Evaluate the double integrals in Exercises 9 to 11, where R is the rectangle [0, 2] x [—1, 0]. 


13. Evaluate the iterated integral: 
9. J| eraa 9 
R 
1 pl ; 
1 
10. Jf (ivicos jx dy dx f | exe dxdy. 
R 4 o Jo 
id 14. Find the volume bounded by the graph of 
IL —xe* sin = 
[| ( cl 57y) Hii f(x, y) = 1+ 2x + 3y, the rectangle [1, 2] x [0, 1], 
and the four vertical sides of the rectangle R, as in 
I2. Evaluate the iterated integral: Figure 5.1.1. 
ay Xy dxd 15. Repeat Exercise 14 for the function f (x, y) = x* + y? 
1 (y) y: and the rectangle [—1, 1] x [—3, —2]. 


5.2 The Double Integral Over a Rectangle 


Weare ready to give a rigorous definition of the double integral as the limit of a sequence 
of sums. This will then be used to define the volume of the region under the graph of a 
function f(x, y). We shall not require that f(x, y) > 0; butif f(x, y) assumes negative 
values, we shall interpret the integral as a signed volume, just as for the area under the 
graph of afunction of one variable. In addition, we shall discuss some of the fundamental 
algebraic properties of the double integral and prove Fubini’s theorem, which states that 
the double integral can be calculated as an iterated integral. To begin, let us establish 
some notation for partitions and sums. 


Definition of the Integral 


Consider a closed rectangle R c R?; that is, R is a Cartesian product of two inter- 
vals: R = [a,b] x [c,d]. By a regular partition of R of order n we mean the two 
ordered collections of n + 1 equally spaced points {x)}'_» and {Yk}k-o; that is, the 
points satisfying 


a= Xp <X1 <--- <X, =D, C=Vo<YVi<-::-<yn=d 


and 


P para y E 
j+1 u= n! k+1 k= n 


(see Figure 5.2.1). 

A function f(x, y) is said to be bounded if there is a number M > 0 such that 
—M < f(x,y) < M forall (x, y) in the domain of f. A continuous function on a 
closed rectangle is always bounded, but, for example, f(x, y) = 1/x on (0, 1] x [0, 1] 
is continuous but is not bounded, because 1/x becomes arbitrarily large for x near 0. 
The rectangle (0, 1] x [0, 1] is not closed, because the endpoint 0 is missing in the first 
factor. 
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figure 5.2.1 A regular partition 
of a rectangle R, with n= 4. 


Let Rj, be the rectangle [X;, Xj+1] x [Yk Vera], and let ¢, be any point in R jx. 
Suppose f: R — Risa bounded real-valued function. Form the sum 


n—-1 n-1 
Si = XO fq) Ax Ay = $ flq) A (1) 
j,.k=0 j,k=0 
where 
b—a d—c 
de | aera i ee | oar 1 
and 


AA = AX Ay. 


This sum is taken over all j’s and k's from 0 to n — 1, and so there are n? terms. A sum 
of this type is called a Riemann sum for f. 


Definition Double Integral If the sequence {Sn} converges to a limit S as 
n — ooand if the limit S is the same for any choice of points G in the rectangles 
Rjk, then we say that f is integrable over R and we write 


Jf tyda, Jf tix yaxa, or J| taw 
R R R 


for the limit S. 


Thus, we can rewrite integrability in the following way: 


limit > f (Gx) Ax ay = ff f dx dy 


j,k=0 


for any choice of Gk € Rix. 


Properties of the Integral 


The proof of the following basic theorem is presented in the Internet supplement for 
Chapter 5. 


figure 5.2.2 The sum of inscribed 
boxes approximates the volume 
under the graph of z= f(x, y). 


figure 5.2.3 The volume of 
circumscribed boxes also 
approximates the volume under 
z= f(x,y). 
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Theorem | Any continuous function defined on a closed rectangle R is 
integrable. 


If f(x, y) > 0, the existence of limit,_... Sn has a straightforward geometric mean- 
ing. Consider the graph of z = f(x, y) as the top of a solid whose base is the rectangle 
R. If we take each q, to be a point where f(x, y) has its minimum value? on Rik: 
then es Ax Ay represents the volume of a rectangular box with base Rjx. The sum 
ae xo f (Gx) Ax Ay equals the volume of an inscribed solid, part of which is shown in 
Figure 5.2.2. 

T if Gk is a point where f(x, y) has its maximum on Rjx, then the sum 
De xo f (Gk) Ax Ay is equal to the volume of a circumscribed solid (see Figure 5.2.3). 


>N 


>N 


2Such G exist by virtue of the continuity of f on R; see Theorem 7 in Section 3.3. 
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figure 5.2.5 Curves in the plane 
represented as graphs. 


N 


A A 
Break in the 
j surface 
A “broken surface” 4 z=f (x, y) 
z= fl, y) 
l 
l = ell, 
1 
F 
pii ee al \ 
= | i 7 > y 


l 

] 

I l R 
7 

] 

] 


(7 


Set of discontinuities of f 
Set of discontinuities of f 


figure 5.2.4 What the graphs of discontinuous functions of two variables might look like. 


x: yO) 
x? y) 


x 


Therefore, if limito Sn exists and is independent of Gk € Rjx, it follows that the 
volumes of the inscribed and circumscribed solids approach the same limitasn — oo. It 
is therefore reasonable to call this limit the exact volume of the solid under the graph of 
f . Thus, the method of Riemann sums supports the concepts introduced on an intuitive 
basis in Section 5.1. 

There is a theorem guaranteeing the existence of the integral of certain discontinuous 
functions as well. We shall need this result in the next section in order to discuss the 
integrals of functions over regions more general than rectangles. We shall be specifically 
interested in functions whose discontinuities lie on curves in the xy plane. Figure 5.2.4 
shows two functions defined on a rectangle R whose discontinuities lie along curves. In 
other words, f is continuous at each point that is in R, but not necessarily on the curve. 

Useful curves are graphs of functions such as y = @(x),a < x <b,orx = w(y), 
c < y < d, or finite unions of such graphs. Some examples are shown in Figure 5.2.5. 

The next theorem provides an important criterion for determining whether a function 
is integrable. The proof is discussed in the Internet supplement. 


Theorem 2 Integrability of Bounded Functions Let f:R —> R be a 
bounded real-valued function on the rectangle R, and suppose that the set of 
points where f is discontinuous lies on a finite union of graphs of continuous 
functions. Then f is integrable over R. 
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Using Theorem 2 and the remarks preceding it, we see that the functions sketched in 
Figure 5.2.4 are integrable over R, because these functions are bounded and continuous 
except on graphs of continuous functions. 

From the definition of the integral as a limit of sums and the limit theorems, we can 
deduce some fundamental properties of the integral ffà f(x, y) dA; these properties 
are essentially the same as for the integral of a real-valued function of a single variable. 

Let f and g beintegrable functions on the rectangle R, and letc bea constant. Then 
f +g andcf areintegrable, and 


(i) Linearity 


[ [tty +90, yea = ff Fix, y)da+ ff g(x, y) dA 


(ii) Homogeneity 
If cf(x,y)da =c ff f(x, y) dA 


(iii) Monotonicity If f(x, y) > g(x, y), then 


II Flx,y)da > ff g(x, y) dA 


(iv) Additivity If Rj,i = 1,...,m, are pairwise disjoint rectangles such that f is 
bounded and integrable over each R; andif Q = RiU R2U-- -U Rm isa rectangle, 
then f: Q — R is integrable over Q and 


Jf rox yaa =y ff f(x, y) dA 


Properties (i) and (ii) are a consequence of the definition of the integral as a limit of 
a sum and the following facts for convergent sequences {Sn} and {Tn}, which are proved 
as with the limit theorems in Chapter 2: 


limit (Ta + Sn) = limitT, + limit Sn 
Noo 00 oo 
limit (CS) =C limit Sn. 
To demonstrate monotonicity, we first observe if h(x, y) > 0 and {Sn} isa 
sequence : Riemann sums that converges to {/, h(x, y) dA, then Sn > 0 for all n, so 


that Jinn X, i )dA = limito Sn > 0. If f(x, y) > g(x, y) for all (x, y) € R, then 
(f —g)(x, y) > 0 forall (x, y), and, using properties (i) and (ii), we have 


f| tya- f| 90.04 = Jf 1y) -olx ida > 0. 


This proves property (iii). The proof of property (iv) is more technical, and a special 
case is proved in the Internet supplement. It should be intuitively obvious. 
Another important result is the inequality 
J | If |dA. (2) 
R 


Jes 
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To see why formula (2) is true, note that, by the definition of absolute value, 
-ifI<f<Ifi; 


therefore, from the monotonicity and homogeneity of integration (with c = —1), 


- ff itio < ff tans [finan 


which is equivalent to formula (2). 


Fubini’s Theorem 


Although we have noted the integrability of a variety of functions, we have not yet 
established rigorously a general method of computing integrals. In the case of one 
variable, we avoid computing J? f (x) dx from its definition as a limit of a sum by using 
the fundamental theorem of integral calculus. This important theorem tells us that if f 
is continuous, then 


b 
J f(x) dx = F (b) — F (a), 


where F is an antiderivative of f; thatis, F’ = f. 

This technique will not work as stated for functions f(x, y) of two variables. H ow- 
ever, as we indicate in Section 5.1, we can often reduce a double integral over a rectangle 
to iterated single integrals; the fundamental theorem then applies to each of these sin- 
gle integrals. Fubini’s theorem, which was mentioned in the last section, establishes 
this reduction to iterated integrals rigorously, by using Riemann sums. As we saw in 
Section 5.1, the reduction, 


1 tyas f S Fox, y)dy| x= f [f tnad dy, 


is a consequence of Cavalieri’s principle, at least if f(x, y) > 0. In terms of Riemann 
sums, it corresponds to the following equality: 


n—1 n—1 /n-1 n—1 /n-1 
f (qx) Ax Ay = 0 (S f(g) s) ax=> ( (Gi) ax) Ay, 
j,k=0 j=0 \k=0 k=0 \j=0 


which may be proved more generally as follows: Let [ajk] be an n x n matrix, where 
0<j<n-—land0O<k<n-—1.Let beer ajk be the sum of the n2 matrix entries. 


Then 
n— n-l /n-1 n-1 /n-1 
ajk = (S an) = (£ an) $ (3) 
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In the first equality, the right-hand side represents summing the matrix entries first 
by rows and then adding the results: 


A(n-1)0 A(n—1)1 see Ael ot" Amni) 


n-1 n-1 
j=0 k=0 


Clearly, this is equal to Diko ajk; that is, the sum of all the ajk. Similarly, the sum Bie 
(oS ajk) represents a summing of the matrix entries by columns. This establishes 
equation (3) and makes the reduction to iterated integrals quite plausible if we remember 
that integrals can be approximated by the corresponding Riemann sums. The actual proof 
of Fubini’s theorem exploits this idea. 


Theorem 3 Fubini’s Theorem Let f bea continuous function with a rect- 
angular domain R = [a, b] x [c, d]. Then 


f [tengas f f te= ff t yoa. (4) 


proof We shall first show that 


[ f Fix, y)dyax= f f(x, y) dA. 


Letc = yo < Yı <--- < Yn = d be a partition of [c, d] into n equal parts. Define 


d 
Fix) = [ f(x, y) dy. 


Then 
n—1 


F(x) = a f (x, y) dy. 


k=0 “Y Yk 


Using the integral version of the mean-value theorem,? for each fixed x and for each k 
we have 


[6 yhdy = £1 Ye Ye = Wi 


Yk 


3T his states that if g(x) is continuous on [a, b], then f g(x) dx = g(c)(b—a) for some pointc e [a, b]. 
The more general second mean-value theorem was proved in Section 3.2. 
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figure 5.2.6 The notation 
needed in the proof of Fubini’s 
theorem: n= 8. 


(see Figure 5.2.6), where the point Y,(x) belongs to [ yx, Yk+1] and may depend on x, k, 
and n. 
We have thus shown that 


F(x) = f(X, Yk(X))( Yar — Yk). (5) 
k=0 


By the definition of the integral in one variable as a limit of Riemann sums, 


b b d n-1 
F(x)dx = | f(x, y) ay dx = limit X F (p;)(Xj41 — Xj), 
fros- fiy iit SF ax 


wherea = Xo < X1 < ++- < Xn = b isa partition of the interval [a, b] into n equal parts 
and pj isany pointin[x;, Xj+1]. Setting Gk = (pj, Yx(p;)) € Rjx, we have [substituting 
p; for x in equation (5)] 


n-1 


F( pj) =X F(GK)( Yara — Yi). 
k=0 
Therefore, 
b pd b n-1 
Il F(x, ylaydx =f F (x) dx = limit SF ( pj)(Xj41 — Xj) 
À r a a = 


rer 


n—-1 n-1 


= limit 5 f(Gk)( Yat — Yk) (Xj+1 — Xj) 
j=0 k 


=0 
2 I f(x, y) dA. 
R 
Thus, we have proved that 


a F(x, y) aydx = | f(x, y) dA. 


ll 
So 


example | 


solution 
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By the same reasoning we can show that 


[ [ enw- ff f(x, y) dA. 


These two conclusions are exactly what we wanted to prove. E 


Fubini’stheorem can be generalized to the casewhere f isnotnecessarily continuous. 


Although we shall not present a proof, we state here this more general version. 


Theorem 3) Fubini’s Theorem Let f bea bounded function with domain a 
rectangle R = [a, b] x [c, d], and suppose the discontinuities of f lie ona finite 
union of graphs of continuous functions. If the integral iN f (x, y) dy exists for 


each x € [a, b], then 
b d 
/ | f Fox, y) dy] dx 


iy F(x, y)dyax= ff f(x, y) dA. 


Similarly, if f? f(x, y) dx exists for each y e [c, d], then 


[ | [te nex 
[ [ enea ff f(x, y) dA. 


Thus, if all these conditions hold simultaneously, 


f f enuas f f tya ff f(x, y) dA. 


exists and 


exists and 


The assumptions made for this version of Fubini’s theorem are more complicated 


than those we made in Theorem 3. They are necessary because if f is not continuous 
everywhere, for example, there is no guarantee that f f(x, y) dy will exist for each x. 


Compute ffi (x? + y) dA, where R is the square [0, 1] x [0, 1]. 


By Fubini’s theorem, 


[ferns | [erna f |f eya dy. 
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example 2 


solution 


example 3 


solution 


By the fundamental theorem of calculus, the x integration may be performed: 
i 


l x3 1 
[etnas [Fry ==+y. 
0 3 3 


x=0 


aie 1 yJ? 5 
2 = = = ria. == ae 
[fo +y = f [5+] w= [y+ zl =F 


What we have done is hold y fixed, integrate with respect to x, and then evaluate the 
result between the given limits for the x variable. Next, we integrated the remaining 
function (of y alone) with respect to y to obtain the final answer. A 


Thus, 


A consequence of Fubini’s theorem is that interchanging the order of integration in the 
iterated integrals does not change the answer. Verify this for Example 1. 


We carry out the integration in the other order: 


1 1 y? 1 
f [enaa | py] ={ |r +5] 0 
o Jo 0 2 iy 2 
- [5+3], = 
—|3 73,76 a 


Wehaveseenthatwhen f(x, y) > OonR = [a, b]x[c, d], theintegral [fp f(x, y) dA 
can be interpreted as a volume. If the function also takes on negative values, then the 
double integral can be thought of as the sum of all volumes lying between the surface 
z = f(x, y) and the plane z = 0, bounded by the planes x = a,x = b, y = c, and 
y = d; here the volumes above z = 0 are counted as positive and those below as 
negative. However, Fubini’s theorem as stated remains valid in the case where f(x, y) 
is negative or changes sign on R ; that is, there is no restriction on the sign of f in the 
hypotheses of the theorem. 


Let R be the rectangle [—2,1] [0,1] and let f be defined by f(x,y) = 
y(x? — 12x); a y) takes on both positive and negative values on R. Evaluate the 
integral ff, f(x, y) dxdy = ff, y(x? — 12x) dx dy. 


By Fubini’s theorem, we can write 


1 1 1 
Ji yx- 12x) dxdy = f p ylx? — 12x) d| y=7 f ydy = 
R o LJ-2 4 Jo 8 


Alternatively, integrating first with respect to y, we find 


II y(x? — 12x) dy dx = ? [fo — 12x)y ay! dx 


S eee Lie? a 3) 
=5 f -1x)éx=5 [7-62] = 3. A 


5.2 The Double Integral Over a Rectangle 281 — 


The Riemann Integral 


The first time most mathematics students encounter the name of Bernhard 
Riemann is in their calculus courses, where they read about the Riemann 
integral. Leibniz had thought of the integral of a function of one variable as an 
infinite sum (the f standing for a sum) of infinitesimal areas f(x) dx, where dx is 
an “infinitesimal width” and f(x) is the height of the corresponding “infinitesimally 
thin” rectangle. This intuitive approach sufficed for most purposes because the 
fundamental theorem 


Historical Note 


b 
f F(x) dx = F(b) — F(a) 


showed how to evaluate this (nebulously defined) integral when one knows the 
antiderivative F of f. 

However, Riemann was interested in applying integration to functions of one 
variable where the antiderivative was not known, and to functions in number 
theory or in general to those functions that “one need not find in nature.” 

Cauchy had already known that all continuous functions could be integrated 
and that the fundamental theorem was valid—that is, every continuous function 
had an antiderivative. However, his proofs were not entirely rigorous. For 
applications to number theory and to certain series (called Fourier series), 
Riemann needed a clear, precise definition of the integral, which he presented in 
a paper in 1854. In this paper he defines his integral and gives necessary and 
sufficient conditions for a bounded function f to be integrable over an interval 
(a, b). 

In 1876, the German mathematician Karl J. Thomae generalized Riemann’s 
integral to apply to functions of several variables, as we do in this chapter. We 
further develop this approach in the Internet supplement. 

In the first half of the nineteenth century, Cauchy had observed that for 
continuous function of two variables, Fubini’s theorem was valid. But Cauchy also 
gave an example of an unbounded function of two variables for which the 
iterated integrals were not equal. In 1878, Thomae gave the first example of a 
bounded function of two variables where one iterated integral exists and the 
other does not. In these examples, the functions were not “Riemann integrable” 
in the sense described in this section. Cauchy and Thomae’s examples 
demonstrated that one must apply caution and not necessarily assume that 
iterated integrals are always equal. 

In 1902, the French mathematician Henri Lebesgue developed a truly 
sweeping generalization of the Riemann integral. Lebesgue’s theory allowed 
integration of vastly more functions than did Riemann’s approach. Perhaps, 
unforeseen by Lebesgue, his theory was to have a profound impact on the 
development of many areas of mathematics in the twentieth century—in 
particular the theory of partial differential equations and probability theory. 
Mathematics students go into more depth about the Lebesgue integral in their 
first year of graduate study. 

In 1907, the Italian mathematician Guido Fubini used the Lebesgue integral to 
state the most general form of the theorem on the equality of iterated integrals, 
the form that is studied today and used by working mathematicians and 
scientists in their research. 
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exercises 


L Evaluate each of the following integrals if 
R = (0, 1] x [0, 1]. 


(a) [fw + y?) dA 

(b) p yer’ dA 
R 

(c) Jf cosan 
R 

(d) a In[(x + 1)(y +1)] dA 
R 


2. Evaluate each of the following integrals if 
R = [0, 1] x [0, 1]. 


(a) J (x™y") dxdy, wherem, n > 0 
R 


(b) J| to by +0) day 
R 

(c) I sin (x + y) dx dy 
R 


(d) J| v2 +2 tym dx dy 
R 


3. Evaluate over the region R: 


yx3 
// ea) 
R 


4. Evaluate over the region R: 


y 3 
J| dxdy, R: [0, 1] x [-2, 2]. 
R 


5. Sketch the solid whose volume is given by: 


1 pl 
f f 6-x-navan 
0 Jo 


6. Sketch the solid whose volume is given by: 


3 2 
| | (9 + x? + y?) dxdy. 
0 J0 


7. Compute the volume of the region over the rectangle 
[0, 1] x [0, 1] and under the graph of z = xy. 


R: [0, 2] x [-1, 1]. 


8. Compute the volume of the solid bounded by the xz 
plane, the yz plane, the xy plane, the planes x = 1 and 
y = 1, and the surface z = x? + y4. 


9. Let f be continuous on [a, b] and g continuous on 


[c, d]. Show that 


b d 
/ (Foxatynideay = | f road] f sna, 
R a È 


where R = [a, b] x [c,d ]. 

Compute the volume of the solid bounded by the surface 
z = sin y, the planes x = 1, x = 0, y = 0, and y = x/2, 
and the xy plane. 

Compute the volume of the solid bounded by the graph 
Z=xX? + y, the rectangle R = [0, 1] x [1, 2], and the 
“vertical sides” of R. 


Let f be continuous on R = [a, b] x [c, d]; for 
a <x <b,c <y <d, define 


x y 
rion ff Pa dwau. 
a E 


Show that a2F /ax ay = Ə?F /ay ax = f(x, y). Use 
this example to discuss the relationship between Fubini’s 
theorem and the equality of mixed partial derivatives. 


Consider the integral in 2(a) as a function of m and n; 


that is, 
f(m,n) := Jf era. 
R 


Evaluate liMm,n—+oo f (m, n). 


mn = f J cos nx sin my dx dy. 


Show that liMm,n—+oo f (m, n) = 0. 


Let: 


Let f: [0, 1] x [0, 1] —> R be defined by 


TE 1 x rational 
a 2y x irrational. 


Show that the iterated integral fo eh f(x, y) dy] dx 
exists but that f is not integrable. 


Express SIr cosh xy dx dy as a convergent sequence, 
where R = [0, 1] x [0, 1]. 
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17. Although F ubini’s theorem holds for most functions met yet 
in practice, we must still exercise some caution. This 1 pl 
DE , ae ay a l ‘any? 
exercise gives a function for which it fails. By using a xe —y dx dy = a: 
substitution involving the tangent function, show that o Jo (X +y?) 4 


i pl y2 
h hw 


— y? 


W hy does this not contradict Theorem 3 or 3’? 


18. Let f becontinuous, f > 0, on the rectangle R. If 


+y’)? 


dydx = 2, 
4 Jf, f dA = 0, prove that f = Oon R. 


5.3 The Double Integral Over More General Regions 


figure 5.3.1 Some y-simple 
regions. 


Our goal in this section is twofold: First, we wish to define the double integral of a 
function f(x, y) over regions D more general than rectangles; second, we want to 
develop a technique for evaluating this type of integral. To accomplish this, we shall 
define three special types of subsets of the xy plane, and then extend the notion of the 
double integral to them. 


Elementary Regions 


Suppose we are given two continuous real-valued functions ¢;: [a,b] —> R 
and ¢2:[a, b] —> R that satisfy ¢,(x) < (x) for all x €e [a,b]. Let D be the set 
of all points (x, y) such that x € [a, b] and #i(x) < y < @(x). This region D is said 
to be y-simple. Figure 5.3.1 shows various examples of y-simple regions. The curves 
and straight-line segments that bound the region together constitute the boundary of D , 
denoted aD . We use the phrase “ y-simple” because the region is described in a relatively 
simple way, using y as a function of x. 

Wesay thata region D is x-simpleif there are continuous functions y and yz defined 
on [c, d] such that D is the set of points (x, y) satisfying 


yelc,d] and wily) <x < wy), 


where wi( y) < y2( y) forall y € [c, d]. Again, the curves that bound the region D con- 
stitute its boundary dD . Some examples of x-simple regions are shown in Figure 5.3.2. 
In this situation, x is the distinguished variable, given as a function of y. Thus, the 
phrase x-simple is appropriate. 

Finally, a simple region is one that is both x- and y-simple; that is, a simple region 
can be described as both an x-simple region and a y-simple region. An example of a 
simple region is a unit disk (see Figure 5.3.3). 


y y 
y= h | x| 
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figure 5.3.2 Some x-simple 
regions. 


figure 5.3.3 The unit disk, a 
simple region: (a) as a y-simple 
region, and (b) as an x-simple 
region. 


yeh? fe 


x= y (y) 
(a) (b) 


Sometimes we will refer to any of the regions as elementary regions. N ote that the 
boundary aD of an elementary region is the type of set of discontinuities of a function 
allowed in Theorem 2. 


The Integral over an Elementary Region 


We can now use an interesting “trick” to extend the definition of the integral from 
rectangles to elementary regions. 


Definition Integral over an Elementary Region If D is an elementary 
region in the plane, choose a rectangle R that contains D. Given f:D — R, 
where f is continuous (and hence bounded), define ffa f(x, y) dA, the integral 
of f over the set D, as follows: Extend f to a function f* defined on all of R by 


Z _ff(txy) if (y)eD 
ran- if (x,y) g D and (x, y) € R. 


Note that f* is bounded (because f is) and continuous except possibly on the 
boundary of D (see Figure 5.3.4). The boundary of D consists of graphs of con- 
tinuous functions, and so f* is integrable over R by Theorem 2, Section 5.2. 
Therefore, we can define 


A f(x, y) aa = ff f*(x, y) dA. 


When f(x, y) > 0 on D, we can interpret the integral ffa f(x, y) dA as the volume 
of the three-dimensional region between the graph of f and D, as is evident from 
Figure 5.3.4. 


figure 5.3.4 (a) Graph of 

z= f(x, y) over an elementary 
region D. (b) Shaded region 
shows graph of z= f*(x, y) on 


some rectangle R containing D. 


From this picture we see that 
boundary points of D may be 
points of discontinuity of f*, 
because the graph of 

Z= f*(x, y) can be broken at 
these points. 


x Elementary region 
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z g 


i; Graph of z: f(x, y) 


ind | 7 
| c | d | c l 
| Ip y | | 
7 l + I 4 ly 
l 2 | Pa l oe ee 
v D V V D L 
y z i 


x 


Graph of ft 


(a) (b) 
We have defined ffo f(x, y)dxdy by choosing a rectangle R that encloses D. It 


should be intuitively clear that the value of ffa f(x, y) dx dy does not depend on the 
particular R we select; we shall demonstrate this fact at the end of this section. 


Reduction to Iterated Integrals 


If R = [a, b] x [c,d] is a rectangle containing D, we can use the results on iterated 
integrals in Section 5.2 to obtain 


n tx y) aa = ff reya f f Friii 
= [ft 9 dkay, 


where f* equals f in D and zero outside D , as before. Assume that D is a y-simple 
region determined by functions ¢ı: [a,b] —> R and ¢2:[a, b] —> R. Consider the 


iterated integral 
b d 
/ n f*(x, y) dy dx 
a E 


and, in particular, the inner integral fi f*(x, y) dy for some fixed x (Figure 5.3.5). By 
definition, f*(x, y) = 0 if y < f(x) or y > (Xx), So we obtain 


d ¢2(x) b2(Xx) 
f Fx yldy= f Fo yay= f F(x, y)dy. 
c g1(X) ġı(x) 


figure 5.3.5 The region between two graphs—a 
y-simple region. 
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example | 


solution 


ga) =0 (> 0) 


figure 5.3.6 A triangle T 
represented as a y-simple 
region. 


example 2 


We summarize what we have obtained in the following. 


Theorem 4 Reduction to Iterated Integrals If D is a y-simple region, 
as shown in Figure 5.3.5, then 


b  ph2(x) 
ff tongs ff 
D a J¢i(x) 


In the case f(x, y) = 1 forall (x, y) e D, ff, f(x, y)dA is the area of D. On the 
other hand, in this case, the right-hand side of formula (1) becomes: 


$2(x) 
E t yaya = | lgb) - 
glx 


which is the formula for the area of D learned in one-variable calculus. Thus, formula 
(1) checks in this case. 


f(x, y) dy dx. (1) 


x)]dx = A(D), 


Find ff, (x?y + cosx) dA, where T is the triangle consisting of all points (x, y) such 
that0 <x <7/2,0<y <x. 


, we have 


m/2 
1i x?y + cosx) dA = f fi x?y + cosx) dy dx 
m/2 FT y3y,2 x m/2 
a [E +y cosx] ax= | (5 +xcosx) dx 
0 2 y=0 0 2 


Referring to Figure 5.3.6 and formula (1) 


x61 7/2 x/2 i xô z/2 
= al +i (xcosx) dx = ———— (12)(64) + [x sinx + cos x]p 
of PEUN 

768 2 A 


In the next example, we use formula (1) to find the volume of a solid whose base is 
a nonrectangular region D. 


Find the volume of the tetrahedron bounded by the planes y=0, z = 0, x = 0, and 
y —X +z = 1 (Figure 5.3.7). 


Z 
A 


(0, 1, 0) 


y-xtz=1 


figure 5.3.7 A tetrahedron bounded by the planes 
y=0,z=0,x=0,and y- x+z=1. 


/ (-1, 0, 0) 


solution 


example 3 


solution 
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We first note that the given tetrahedron has a triangular base D whose points (x, y) 
satisfy —1 < x < Oand 0 < y < 1 +x; hence, D isa y-simple region. In fact, D isa 
simple region; see Figure 5.3.8. 


y 


figure 5.3.8 The base of the tetrahedron in 
Figure 5.3.7 represented as a y-simple region. 


h x)=xt 1 


C 1,0) G(x) = 0 (0, 0) 


For any point (x, y) in D, the height of the surface z above (x, y) is 1 — y +x. Thus, 
the volume we seek is given by the integral 


[fa-y+oa 


Using formula (1) with @(x) = 0 and (x) = x +1, we have 


14x y? 14+x 
1 a-y+nda= f f a-y+dya= f ja ay- dx 
= y=0 


+ 
1+x)? dk (1+x}]" 1 
-/ | 2 | =| 6 | = 6 a 


Let D bea y-simple region. Describe its area A(D ) as a limit of Riemann sums. 


If we recall the definition, A(D) = ffo dxdy is the integral over a containing rect- 
angle R of the function f = 1. A Riemann sum S, for k D is obtained by 
dividing R into subrectangles and forming the sum Sn = *(Gk) Ax Ay, as 
in formula (1) of Section 5.2. Now f*(q x) is 1 or 0, RATA on whether or not 
Gk isin D. Consider those subrectangles Rj, that have nonvoid intersection with D, 

and choose Gk in D N Rix. Thus, S, is the sum of the areas of the subrectangles that 
meet D and A(D) is the limit of these as n — oo. Thus, A(D) is the limit of the 
areas of the rectangles “circumscribing” D . You should draw a figure to accompany this 
discussion. A 


The methods for treating x-simple regions are entirely analogous. Specifically, we 
have the following. 


Theorem 4 Iterated Integrals for x-Simple Regions Suppose that D is 
the set of points (x, y) such that y €e [c,d] and wi(y) < x < wy). If f is 
continuous on D, then 


d yaly) 
Tf f(x, y) dA = J Fx, y) dx dy. (2) 
D C wily) 
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To find the area of D, we substitute f = 1 in formula (2); this yields 


I dA = foy — ya (y)) dy. 


Again, this result for area agrees with the results of single-variable calculus for the area 
of a region between two curves. 

Either the method for y-simple or the method for x-simple regions can be used for 
integrals over simple regions. 

It follows from formulas (1) and (2) that ffa f dA is independent of the choice of the 
rectangle R enclosing D used in the definition of ffa f dA, because, if we had picked 
another rectangle enclosing D, we would have arrived at the same formula (1). 


exercises 


L In parts (a) through (d) below, each iterated integral is an 2. Sketch the region D in R? that represents the region of 
integral over a region D . Match the integral with the integration: 


correct region of integration. 


2 p4-y? 
(a) If (4 — x) dxdy 
—2 J0 
2 pæ 2 r0 3 px 
a f f ayax w dx dy o f f t+ — 2x) dyd 
1 Jinx : 0 J—./9-y2 H 0 A $ j = 


2 pe 3 p0 3. Evaluate the following iterated integrals and draw the 
(b) dy dx (d) J f dx dy regions D determined by the limits. State whether the 
0 J(1/8)x 0 Jarccosy/3 


regions are x-simple, y-simple, or simple. 


1 px? l1 pæ 
y y (a) i. dy dx (c) 1 (x + y) dy dx 
} } o Jo 0 Ji 
2 p3x+1 1 px? 
7 (b) I dy dx (d) I / y dy dx 
a 1 J2x 0 Jx 


-3 EEA 4. Evaluate the following integrals and sketch the 
| corresponding regions. 


a a 
| (a) / | (x? + y) dx dy 
-3 J0 
(i) (ii) 1 pxl 
(b) IJ e*+Ydy dx 
—1 J —2ļx| 
1 (1—x2)1/2 
(c) If dy dx 
0 Jo 


T T m/2 pcosx 
ar 27 a f f y sin xdy dx 
24 1+ 0 0 


m H mm 1 py 
i 2 Ld? (e) [I (x"+y™)dxdy, m,n>0 
0 Jy? 


1/2 


7 ij 0 p2(1=x?) 
(f) J | xdy dx 
(iti) (iv) -1J0 


Hex 


n|” 


> 
> 


Use double integrals to compute the area of a circle of 
radius r. 


Using double integrals, determine the area of an ellipse 
with semiaxes of length a and b. 


What is the volume of a barn that has a rectangular base 
20 ft by 40 ft, vertical walls 30 ft high at the front (which 
we assume is on the 20-ft side of the barn), and 40 ft 
high at the rear? The barn has a flat roof. Use double 
integrals to compute the volume. 


Let D be the region bounded by the positive x and 
y axes and the line 3x + 4y = 10. Compute 


1 (x? + y?) dA. 
D 


Let D be the region bounded by the y axis and the 
parabola x = —4y? + 3. Compute 


I x3y dx dy. 
D 


x2 
Evaluate (x? + xy — y?) dy dx. Describe this 


o Jo 
iterated integral as an integral over a certain region D in 
the xy plane. 


Let D be the region given as the set of (x, y), where 
1 <x? + y? < 2and y > 0.IsD an elementary region? 
Evaluate Th f(x, y) dA, where f(x, y) =1+4xy. 


Evaluate the following double integral: 


I cos y dx dy, 
D 


where the region D is bounded by y = 2x, y = x, 
X =m,andx = 27. 


13. Evaluate the following double integral: 


I xy dA, 
D 


5.4 Changing the Order of Integration 


17. 
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where the region D is the triangular region whose 
vertices are (0, 0), (0, 2), (2, 0). 


Use the formula A(D) = ff, dx dy to find the area 
enclosed by one period of the sine function sin x, for 
0 <x < 2x, andthe x axis. 


Find the volume of the region inside the surface 
z = x? + y? and between z = 0 and z = 10. 


Set up the integral required to calculate the volume of a 
cone of base radius r and height h. 


Evaluate ffy y dA, where D is the set of points (x, y) 
such that 0 < 2x/x < y, y < sinx. 


From Exercise 9, Section 5.2, 


b d b 
IJ Foxy) aydx = ( f fo ax) 
a C a 


d 
(/ say). Is it true that ff, f(x)g(y) dx dy = 
€ 


b a(b) 
(/ f(x) ix) q g(y) a) for y-simple 
a gla) 


regions? 


Let D bea region given as the set of (x, y) with 
—(X) < y < #(x) anda <x <b, where ¢ isa 
nonnegative continuous function on the interval [a, b]. 
Let f(x, y) bea function on D such that 

f(x, y) = — f (x, —y) for all (x, y) < D. Argue that 
Sfo f(x. y) dA = 0. 


Use the methods of this section to show that the area of 
the parallelogram D determined by two planar vectors a 
and bis |a1b2 — a2b1|, where a= aÏ + apj and 

b= bii + boj. 


Describe the area A(D) of a region as a limit of areas of 
inscribed rectangles, as in Example 3. 


Suppose that D is a simple region—that is, it is both x-simple and y-simple. Thus, it 
can be given as the set of points (x, y) such that 


a<x <b, 


d(x) < Y < g2(X), 


and also as the set of points (x, y) such that 


c<yx<d, 


Wily) < X < Wy). 
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example | 


solution 


Hence, we have the formulas 


yal y) 
ff tsoa- T tanya f SA f(x, y) dxdy. 
g1(x) yıl 


If we are required to compute one of the preceding iterated integrals, we may do so 
by evaluating the other iterated integral; this technique is called changing the order of 
integration. It can be useful to make such a change when evaluating iterated integrals, 
because one of the iterated integrals may be more difficult to compute than the other. 


By changing the order of integration, evaluate 


(a2—x2)1/2 


f f y2)1/2dy dx. 


Note that x varies between 0 and a, and for each such fixed x, we have 0 < y < 
(a? — x?)!/2, Thus, the iterated integral is equivalent to the double integral 


j e= y2)1/2dy dx, 


where D is the set of points (x, y) such that 0 < x < a and 0 < y < (a? — x?) 1/2. But 
this is the representation of one-quarter (the positive quadrant portion) of the disk of 
radius a; hence, D can also be described as the set of points (x, y) satisfying 


0<y<a, 0<x<(a?-— y?) 


a (a?—y?)1/2 
= yayax = f p (a? — yay dy 
0 0 


a 2_ 21/2 
= f ke- ye ay 


(see Figure 5.4.1). Thus, 


13 


a2—x2)1/2 


figure 5.4.1 The positive-quadrant portion of a 
disk of radius a. 


(a, 0) A 


We could have evaluated the initial iterated integral directly, but, as you can easily 
verify, changing the order of integration makes the problem simpler. The next example 
shows that it may not be obvious how to evaluate an iterated integral, and yet it may 
be relatively simple to evaluate the iterated integral obtained by changing the order of 
integration. 


example 2 


solution 


figure 5.4.2 Dis the region of 
integration for Example 2. 
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Evaluate 
2 plogx 
ie (x —1) V1 +e dy dx. 
pl 0 


It will simplify matters if we first interchange the order of integration. First, notice that 
the integral is equal to Jo (x — 1)v1 +e% dA, where D is the set of (x, y) such that 


1<x <2 and 0< y <logx. 
The region D is simple (see Figure 5.4.2) and can also be described by 
0< y <log2 and e <x <2. 


Thus, the given iterated integral is equal to 


log 2 
J [i-o y [ vire] [ (x — 1) | gy 
ey 


log 2 
/1 + ey Ë -x] oy 
log2 e2y 
| ee 
0 
a 
= -5 f yT Ey [ ey 1+ edy. (1) 


In the first integral in expression (1), we substitute u = e?”, and in the second, v = e’. 
Hence, we obtain 


1 4 2 
|| viFudu+ f V1l+v2dv. (2) 
1 L 
Both integrals in expression (2) are easily found with techniques of one-variable calculus 


(or by consulting the table of integrals at the back of the book). For the first integral, 
we get 


4 4 1 
ra Al +udu = K + u) = a a4)? 2 23/2] 2 p” _ 23/2], (3) 
1 1 
The second integral is 
2 2 
f vl+vdv= ; [evVi+ 0? + log (VI+ v+ v) |, 
1 


= 5 [2v5 + tog (v5 +2) - $ [v2 + log (v2 + 0) (4) 


(see formula 43 in the table of integrals at the back of the book). Finally, we subtract 
equation (3) from equation (4) to obtain the answer 


1 
; (2v5- VZ+ og G+) = (5°? 27), 


A 
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example 3 


Mean-Value Inequality 


We conclude with an inequality that helps us estimate integrals. Suppose there are 
numbers m and M such that for all (x, y) €< D, andm < f(x, y) < M, then integrating 
over D, we get 


m-A(D) < ff F(x, y)dA <M -A(D), (5) 
D 


where A(D) is the area of the region D. Even though this inequality is obvious, it can 
help us estimate integrals that we cannot easily evaluate exactly. 


Consider the integral 


1 
——__—_ dx dy, 
JI Y1+x®+ y8 ' 
where D is the unit square [0, 1] x [0, 1]. Because the integrand satisfies, for x and y 
between 0 and 1, 
1 1 
< < 
v3 ~ y1+x5+ y8 


and because the square has area 1, we get: 


1 1 
—< —— ~  dxdy<1. 
a VJ1+ x4 y8 as A 


Mean-Value Equality 


The mean-value inequality can be turned into an equality when f is continuous. Here 
is the formal statement. 


Theorem 5 Mean-Value Theorem: Double Integrals 
Suppose f : D — Ris continuous and D is an elementary region. Then for some 
point (Xo, Yo) in D we have 


I F(x, y) dA = f (xo, Yo)A(D), 


where A(D ) denotes the area of D. 


proof We cannot prove this theorem with complete rigor, because it requires some 
concepts about continuous functions not proved in this course; but we can sketch the 
main ideas that underlie the proof. 

Because f is continuous on D, it has a maximum value M and a minimum value m. 
Thus, m < f(x,y) < M for all (x,y) € D. Furthermore, f(Xı, yı) = m and 
f (X2, Y2) = M for some pairs (Xi, yı) and (x2, y2) in D. 

Dividing through inequality (5) by A(D ), we get 


1 
nsz), f(x, y)dA <M. (6 


~ 
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Because a continuous function on D takes on every value between its maximum and 
minimum values (thisis the two-variableintermediate-value theorem A in a 
calculus; seealso R eview Exercise32), and becausethenumber[1/A(D )] ffa f(x, y) dA 
is, by inequality (6), between these values, there must be a point (Xo, Yo) € D with 


f (Xo, w= aay ff F(x, y) dA 


which is precisely the conclusion of Theorem 5. m 


exercises 


L Change the order of integration, but do not evaluate, the 


following integrals: 


8 4 /16—y2 ar 
(a) f J dx dy (c) [ jaa 
0 J1/2y 


16— ve 


9 vY m sin x 
(b) If dx dy a f f dy dx 
0 J0 n/2 J0 


2. Change the order of integration and evaluate: 


1 pl 
| il sin(x2) dx dy. 
0 Jy 


3. In the following integrals, change the order of 
integration, sketch the corresponding regions, and 
evaluate the integral both ways. 


1 pl 
(a) f f xy dy dx 
0 X 
m/2 cose 
o f 1 cosé dr dé 
0 0 
1 p2-y 
(c) Ee (x + y)?dx dy 
0 J1 


b py 

(d) F., f (x, y) dx dy (express your answer in 
a a 
terms of antiderivatives). 


Find 


1 pl 
@ f (x + y)2dx dy 
-1 Jy] 
1 pr/(9-y?*) 
(b) / / x?dx dy 
-3 J—./(9-y7) 
4 p2 
(c) If eX’ dx dy 
0 y/2 
1 px/4 
(d) jal (sec? x) dx dy 
0 Jtan-ly 


5. Change the order of integration and evaluate: 


a 
f J e* dxdy. 
0 SV 


6. Consider the intuitive fact that if a region D in R? can 
be split into a disjoint union of subsets D = D1 U D2, 
then a double integral over D may also be divided into a 
sum of two integrals: 


Jf toonea= ff rox yaa || f(x, y) dA. 
D Di D2 


(See Section 5.2 for the analogous statement over a 
rectangular box.) A re the following attempts to change 
the order of integration true or false? 


(a) 


m/4 pCOSXx J2/2 parcsiny 2 arccos y 
f | dy dx a | oxdy+ f | dx dy 
sinx 0 0 V2/2/0 
(b) 2 phx? 4 p/i 
J I dy dx =o / dx dy 
—2 J0 0 J—/a-y 


(c) 
2 p(1/2)x 5 pl 1 p3y+2 
If yas f f dyox= f f dxdy 
0 Jo 2 J(1/3)x—(2/3) 0 J2y 
(d) 1 pæ e pl 
l f ya= | | dx dy 
0 J1 1 Jiny 


7. If f(x, y) = e+) and D = [—x, x] x [—2, 7], 


show that 
i 
ee a J f(x, y) dA < e. 


8. Show that 


sin x 
z= cos) < Jf dx dy < 1. 
[0,1}xt0,1) 1+ (xy)4 
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9. If D =[-1, 1] x [—1, 2], show that 


< dx dy <6. 
~ J Jp 2 +y? +17 


10. Using the mean-value inequality, show that 


1 dA 1 
=< -a = Sj, 
6~ JJp Y-X+37 4 


where D is the triangle with vertices (0, 0), (1, 1), and 
(1, 0). 


IL Compute the volume of an ellipsoid with semiaxes a, b, 
and c. (HINT: Use symmetry and first find the volume of 
one half of the ellipsoid.) 

12. Compute f(x, y) dA, where f(x, y) = y2,/x and 


D 
D is the set of (x, y), where x > 0, y > x2, and 
y 210 — 7, 


13. Find the volume of the region determined by 
x2 + y? +z? < 10, z > 2. Use the disk method from 
one-variable calculus and state how the method is 
related to Cavalieri’s principle. 


14. Evaluate e*—Ydx dy, where D is the interior of the 


D 
triangle with vertices (0, 0), (1, 3), and (2, 2). 


15. Evaluate yh y3(x? + y2)~3/2dx dy, where D is the 


region yetermined by the conditions } 5 <y<land 
x+y? <L 


5.5 The Triple Integral 


16. Given that the double integral I f (x, y) dxdy of a 


D 
positive continuous function f equals the iterated 


1 X 

integral f h Foxy) dx, sketch the region D 
0 x2 

and interchange the order of integration. 


17. Given that the double integral I f (x, y) dx dy of a 
D 
positive continuous function f equals the iterated 


1 2-y2 
integral p a f(x, y) dx} dy, sketch the 
0 y 


region D and interchange the order of integration. 


18. Prove that ; 


b pb b 
2 f I todtonaya= ( f fo ax) ; 
a X a 


[HINT: Notice that 


([ ree) =f f (x) f(y) dxdy.] 
[a,b]x[a,b] 


19. Show that (see Section 2.5, Exercise 29) 


a in yidedy = f f(x, y, z) dz 


ff fx (xX, y, z) dzdy. 


Triple integrals are needed for many physical problems. For example, if the temperature 
inside an ovenisnot uniform, determining the average temperature involves “summing” 
the values of the temperature function at all points in the solid region enclosed by the 
oven walls and then dividing the answer by the total volume of the oven. Such a sum is 
expressed mathematically as a triple integral. 


Definition of the Triple Integral 


Our objective now is to define the triple integral of a function f(x, y, z) over a box 


(rectangular parallelepiped) B 


= [a,b] x [c,d] x [p,q]. Proceeding as in double 


integrals, we partition the three sides of B into n equal parts and form the sum 


n—1 n—-1 n-1 


f ( (Gik) A 
i=0 j=0 k=0 


where Gj, is a pointin Bij, the i j kth rectangular parallelepiped (or box) in the partition 
of B, and AV is the volume of Bj (see Figure 5.5.1). 
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figure 5.5.1 A partition of a box B 
into n° subboxes Bix. 


Definition Triple Integrals Let f be a bounded function of three variables 
defined on B. If limit,... Sn = S exists and is independent of any choice of Gx, 
we call f integrable and call S the triple integral (or simply the integral) of f 
over B and denote it by 


JII f dV, JII f(x, y,z) dV or III f(x, y, z) dx dy dz. 


Properties of Triple Integrals 


As before, we can prove that continuous functions defined on B areintegrable. M oreover, 
bounded functions whose discontinuities are confined to graphs of continuous functions 
[such as x = a(y,Z), y = A(X,Z), or z = y(x, y)] are integrable. The other basic 
properties (such as the fact that the integral of a sum is the sum of the integrals) for 
double integrals also hold for triple integrals. Especially important is the reduction to 
iterated integrals: 


Reduction to Iterated Integrals Let f(x, y, z) be integrable on the box 
B =[a,b] x [c,d] x [p,q]. Then any iterated integral that exists is equal to the 
triple integral; that is, 


q d b 
tx yzidxaya= f° f f f (x, y, z) dx dy dz 
B p Jc a 
q b pd 
zn f(x, y, z) dydxdz 
D AA Ane 
b rq d 
=] f(x, y, z) dydzdx, 
a p Jc 


and so on. (There are six possible orders altogether.) 
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example | | (a) LetB bethe box [0, 1] x [—4, 0] x [0, 4]. Evaluate 


IK + 2y + 3z)2dx dy dz. 
B 


(b) Verify that we get the same answer if the integration is done in the order y first, 
then z, and then x. 


solution (a) According to the principle of reduction to iterated integrals, this integral may be 
evaluated as 


1/3 0 1 
f J f (x + 2y + 3z)?dx dy dz 
0 -1/2 Jo 
1/3 p0 1 
= f J Jove 
0 =1/2 =0 


1/3 1 
ay 1 = |(1 + 2y +32)? — (2y + 3z)°]dy az 
0 -1/2 3 


(x + 2y + 3z)? 


-[" a [(a + 2y + 32)4 — (2y +32) a dz 


y=-1/2 


ya 4 4 
-f A [(32 +0 — 2(32) + (3z — 1) |oz 


[(3z + 0° -282° + 3z- 1)5| ji 


z=0 


o 1I 
~ 24-15 


b 1 
=z 52° -2) = 


vik x + 2y + 3z)*dy dz dx 
1/3 

-[f f (x + 2y + 3z)?dy dz dx 
0 1/2 
Iogi 

=f [ y=-1/2 
1 1/34 

-[f al [(x +32)? — (x + 3z — 1)°]azdx 
0 


_ 1/3 
(x +3z)4 (x+23z aa Ja 
z=0 


12 12 
72 
if 5]! 1 
= i| 4(%—1)° 28) =a ra 


(x + 2y + 3z)? 
6 


fee 


= Phe = ih — 2x4] dx 


example 2 


solution 


example 3 


solution 


figure 5.5.2 Two elementary 
regions in space. The domain D 
in the figure on the left is 
y-simple, while on the right it is 
x-simple. 
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Integrate e*+Y+z over the box [0, 1] x [0, 1] x [0, 1]. 


We perform the integrations in the standard order: 


1 pl pl l1 pl 
f f I ex +ty+z dx dydz = | f (er 2) dy dz 
0 J0 J0 0 J0 
1 pl L 1 
= I | (elty+2 _ ey +2) dydz = f [e = e”+z]| dz 
0 Jo 0 y=0 
1 


1 
-| jer — elt +e| dz = [e — 2e! +e| 
0 0 
=e? — 3e + 3e — 1 = (e — 1)’. A 


Asin the two-variable case, we define the integral of a function f over a bounded region 
W by defining anew function f *, equal to f on W and zero outside W , and then setting 


If F(x, yz dxdydz= || F(x, y, z) dxdydz, 


where B is any box containing the region W. 


Elementary Regions 


A s before, we restrict our attention to particularly simple regions. An elementary region 
in three-dimensional space is one defined by restricting one of the variables to be 
between two functions of the remaining variables, the domains of these functions being 
an elementary (i.e., an x-simple or a y-simple) region in the plane. For example, if D 
is an elementary region in the xy plane and if yı(x, y) and y2(x, y) are two functions 
with y2(x, y) > w(x, y), an elementary region consists of all (x, y, z) such that (x, y) 
liesin D and y(x, y) < Z < y2(x, y). Figure 5.5.2 shows two elementary regions. 


Describe the unit ball x? + y? + z? < 1 as an elementary region. 


This can be done in several ways. One, in which D is y-simple, is: 


Z= HO y) z=, y) 

La c F 
z] oo , 
= y) ee | D DE nw y 

= x= %)(x) 7 
b p -~ = y 
m | 4 
5 y =05(x) 2= 74% 9) 
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example 4 


Z= Hx, y= 1 -x -y? 
y=6,(x) = V1-x?7 


x o | 2=4@y)=- Vix 
figure 5.5.3 The unit ball as an elementary region in space. 


In doing this, we first write the top and bottom hemispheres as z = ,/1 — x? — y? 
and z = —,/1— x? — y?, respectively, where x and y vary over the unit disk (that is, 
—J/1— x? < y < /1—x? and x varies between —1 and 1). (See Figure 5.5.3.) We 
can describe the region in other ways by interchanging the roles of x, y, and z in the 
defining inequalities. A 


Integrals over Elementary Regions 


As with integrals in the plane, any function of three variables that is continuous over an 
elementary region is integrable on that region. An argument like that for double integrals 
shows that a triple integral over an elementary region can be rewritten as an iterated 
integral in which the limits of integration are functions. The formulas for such iterated 
integrals are given in the following box. 


Triple Integrals by Iterated Integration Suppose that W is an elemen- 
tary region described by bounding z between two functions of x and y. Then 
either 


21x) py2(X,y) 
II f(x,y, 2oxayde = f i i f(x, y, Z) dz dy dx 
ġı(x) (x,y) 


[see Figure 5.5.2 (left)] or 


yaly) 
If f(x, y, 2) axoy d= f i re f(x, y, Z) dz dx dy 
yıl yı(x,y) 


[see Figure 5.5.2 (right)]. 


If f = 1, we get the integral [ffy dx dy dz, which is the volume of the region W . 


Verify that the volume formula for the ball of radius 1: 


4 
dxdydz = =z, 
JII pm ae 


where W is the set of (x, y, z) with x? + y? +z? <1. 
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solution We use the description of the unit ball from Example 3. From the first formula in the 
preceding box, the integral is 


j= fly? 
[ — i nl dz dy dx. 
Vie J1-x2-y2 


Holding y and x fixed and integrating with respect to z yields 


[cl eon [Eee] 


Because x is fixed in the y- integral, it can be expressed as fèl — y?)*/? dy, where 
a =(1—x?)}/”, This integral is the area of a semicircular region = radius a, so that 


a 
i (a? _ y2)¥/2 dy = y 
—a 


(We could also have used a trigonometric substitution or a table of integrals.) Thus, 
1—x2 2 
1-x 
l (1—x? - y?) dy = =r, 
_ Stax? 2 


and so 


1 a/ 1—x?2 1 
2 | i. (1= 3? = Paya =2 f m. 
-1J—./1-x2 -1 
1 VENET! 
=x f (1- x) dx = 2 (x- 7) 
“1 3 


X=-1 


Other types of elementary regions are shown in Figure 5.5.4. For instance, in the second 
region, ( y, z) liesin an elementary region in the yz plane and x lies between two graphs: 


pily,Z) < X < poly, 2). 


N 
N 


figure 5.5.4 Types of elementary y y y 
regions in space. x x 


4 $ 


Top and bottom are Front and rear are Left and side are 
surfaces z= y(x, y) surfaces x = p(y, z) surfaces y = 6(x, z) 
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figure 5.5.5 Asymmetric 
elementary region can be 
described in three overall ways. x 


As shown in Figure 5.5.5, some elementary regions can be simultaneously described in 
all three ways. We shall call these regions symmetric elementary regions. 

Corresponding to each description of a region as an elementary region is an integra- 
tion formula. For instance, if W is expressed as the set of all (x, y, z) such that 


c<y<d, wWily)<z<wly),  oly,z) <x <poly,2), 


d pyly) peal y.z) 
i F(x, y,2) dxdy dz = | | J f(x, y, z) dxdzdy. 
w c Jyly) Ypily,z) 


example 5 | Let W bethe region bounded by the planes x = 0, y = 0, and z = 2, and the surface 
z = x? + y? and lying in the quadrant x > 0, y > 0. Compute JII x dx dy dz and 
sketch the region. s 


then 


solution Method 1. The region W is sketched in Figure 5.5.6. As indicated in the figure, we can 
describe this region by the inequalities 


0<x<4/2, 0<y<vV2-xX, xX +y <2z<2. 


figure 5.5.6 Wis the region below the plane z = 2, 
above the paraboloid z = x? + y?, and on the 
positive sides of the planes x = 0, y = 0. 
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Therefore, 


rann- ("LL (E00) 9] 


oe 
-[" a x(2 — x? — y?) dy dx 
= [x[ie-x pp jox 
0 3 


J2 = _ 25/2 v2 
_ | 2X (9 ade =2(2 = xP" 
o 3 15 


5 25⁄2 8/2 


B B 


2)1/2 


Method 2. We can also place limits on x first and describe W by 0 < x < (z — y 
and ( y, z) in D, where D is the subset of the yz plane with 0 < z < 2 and 0 < y < z! 
(see Figure 5.5.7). 


figure 5.5.7 A different description of the region in 
ye Jz Example 5. 


Therefore, 


fpr- ff( +8) 9 
viata: 
“Ef (20s 
=f Fe =f ee 


7 = [e] _ 2 a5 _ 872 
0 


15 15° 15" 


which agrees with our previous answer. A 
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example 6 | Evaluate 


1 X 2 
| f / dz dy dx. 
0 JO Jx2+y2 


Sketch the region W of integration and interpret. 


solution l px p2 l1 px 
SJ] dedydx = | f (2— x2 y?) dydx 
0 JO Jx?+y? 0 J0 
1 


3 
z 3X \)qez-i_1_2 
= (2x X 5) éx=1 Inr: 


This integral is the volume of the region sketched in Figure 5.5.8. 


figure 5.5.8 The region Wlies between the 
paraboloid z= x? + ¥ and the plane z = 2, 
and above the region D. 


exercises 


L In parts (a) through (d) below, each iterated integral is an 
integral over a region D . Match the integral with the 
correct region of integration. 


2 3 pI 1 px py 
(a) f L dydzdx (c) | | f dzdy dx 
o Jo J-Va=x 0 . A 
2 3 vI 
(b) | 1 i dy dx dz (d) | | | dzdx dy 
o Jo J—yo=x? 0: 10:e 0 
z y 


4 Gii) (iv) 


2. Evaluate the following triple integral: 


If sin x dx dy dz, 
w 


where W is the solid given by 0 <x <x,0<y<1, 
G) (ii) and0 <z <x. 
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In Exercises 3 to 6, perform the indicated integration over the given box. 


3 coc = [0, 1] x [0, 1] x [0, 1] 
B 

4 JII e-*Y ydxdydz, B = [0, 1] x [0, 1] x [0, 1] 
B 


5. J [ [ 3ye, B = [0, 2]x[—1, 1]x[0, 1] 
B 


6. JII ze*t¥dxdydz, B = [0, 1] x [0, 1] x [0, 1] 
B 


In Exercises 7 to 10, describe the given region as an elementary region. 


7. The region between the cone z = ,/x2 + y? and the 
paraboloid z = x2 + y? 


8. The region cut out of the ball x2 + y? + z2 < 4 by the 


elliptic cylinder 2x2 + z? = 1; that is, the region inside 
the cylinder and the ball 


Find the volume of the region in Exercises 11 to 14. 


IL The region bounded by z = x2 + y? and 
z = 10 — x? — 2y? 


12. The solid bounded by x2 + 2y? = 2, z = 0, and 
x+y+2z=2 


Evaluate the integrals in Exercises 15 to 23. 


1 p2 p3 
15. | I f cos[z(x + y + z)] dx dy dz 
o Ji J2 
1 px y 
1 | | f y+ xa)ceayen 
o Jo Jo 


17. JII (x? + y? + z?) dx dy dz; W is the region bounded 
w 
by x + y +z = a (wherea > 0), x = 0, y =O, and 
z=0. 


18. JII z dx dy dz; W is the region bounded by the 
w 


planes x = 0, y = 0, z = 0, z = 1, and the cylinder 
x? +y? =1,withx >0,y > 0. 


19. I x? cos zdx dy dz; W is the region bounded by 
w 
z=0,z=7,y=0,y=1,x =0,andx+y=1. 


9. The region inside the sphere x? + y? + z? = 1 and 
above the plane z = 0 


10. The region bounded by the planes x = 0, y = 0, 
z=0,x+y=4,andx =z-—y-1 


13. The solid bounded by x = y,z=0, y 
x+ty+z=0 


= 0, x =1, and 


14. The region common to the intersecting cylinders 
x? +y? <a? and x? +z? < a? 


2 pX pxt+y 
20. J f | dz dy dx 
0 JO J0 


2L (1 — z2) dx dy dz; W is the pyramid with top 


vertax at (0, 0, 1) and base vertices at (0, 0, 0), (1, 0, 0), 
(0, 1, 0), and (1, 1, 0). 


22. II (x? + y?) dx dy dz; W is the same pyramid as in 
w 


Exercise 21. 


1 p2x px+y 
z | | f dz dy dx. 
0 J0 x24y2 
24. (a) Sketch the region for the integral 


1 px y 
| | | f(x, y, z) dz dy dx. 
0 Jo Jo 


(b) Write the integral with the integration order 
dx dy dz. 


For the regions in Exercises 25 to 28, find the appropriate limits 41(x), 62(x), yı(x, y), and y2(x, y), and write the triple 


integral over the region W as an iterated integral in the form 


i ue 


y2(XY) 
p tyi) ee a} dx. 
yı(x,y) 
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25. W = {(x, y,z) | Vx? +y? <2 <1) 30. Let W be the region bounded by the planes 
x=0,y=0,z=0,x+y=1,andz=x+y. 
26. W = (x,y,z) | 3 £Z < land x? +y? +z? < 1) (a) Find the volume of W. 
27. W = {(x, y, z) |x? +y? < 1,z = O and (b) Evalute fff xdxdy dz. 
X? +y? +z? <4} (c) Evalute [ffy ydxdy dz. 
28. W = {(x, y,z) | Ix] < 1, |y] < 1,z > 0 and 3L Let f be continuous and let B, be the ball of radius « 
x? +y? 47 <1} centered at the point (Xo, Yo, Zo). Let vol (B+) be the 


volume of B+. Prove that 
29. Show that the formula using triple integrals for the 


volume under the graph of a positive function f(x, y), lim anes fff f(x, y,z) dV = f (Xo, Yo, Zo). 
on an elementary region D in the plane, reduces to the e>0 vol (Be) B, 
double integral of f over D. 


review exercises for chapter 5 


Evaluate the integrals in Exercises 1 to 4. 


3 x?+1 1 e2x 
Lj) xy dy dx af x In y dy dx 
0 J—x2?+1 0 Je 
Lop 1 p2 73 
aff (x + y)2dy dx aff | cos[x(x + y +z)]dx dy dz. 
0 J yx 0/1 J2 


Reverse the order of integration of the integrals in Exercises 5 to 8 and evaluate. 


5. The integral in Exercise 1 14. Change the order of integration and evaluate 


igi 
6. The integral in Exercise 2 | f (x? + yx) dx dy. 
0 y1/⁄2 


Ze TRG NEG ENIE XeECIe 15. Let D be the region in the xy plane inside the unit circle 


x? + y? = 1. Evaluate ff, f(x, y) dxdy in each of the 


8. The integral in Exercise 4 following cases: 


9. Evaluate the integral [> fă fo’ y + xz) dz dy dx. (a) (a)] f(x, y) = xy 
T (b) f(x, y) =x?y? 
10. Evaluate f, Í eX/Y dx dy. (c) f(x, y) =x3y3 
1L Evaluate i JEYA y cosxy dxdy. 16. Find ff, y[1 — cos (xx/4)ldx dy, where D is the region 
in Figure 5.R.1. 


12. Change the order of integration and evaluate 


ae 
i (x + y)2dx dy. 
0 Jy/2 


x 


13. Show that evaluating ff dxdy, where D is a y-simple 
region, reproduces the formula from one-variable figure 5.R.1 The region of integration for 
calculus for the area between two curves. Exercise: 16. 
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Evaluate the integrals in Exercises 17 to 24. Sketch and identify the type of the region (corresponding to the way the integral is 


written). 


mz p3Sinx 
17. | x(1 + y) dy dx 
0 si 


in x 


1 fx cos(2x/2) 

8 | f (x? + xy + 1) dydx 
0 Jx-1 
1 p(2-y)? 3 

19. | J (5-7-2) dx dy 
<7 y2/3 2 


2 p3(Va=x2) /2 5 
20. | i ( — + s) dy dx 
o J-a x22 \ V2 +X 


1 px? 
a | f (x? + xy — y?) dy dx 
0o Jo 
4 py? 
2 | f 3dx dy 
2 Jy-1 
1 X 
23. f | (x + y)?dy dx 
0 Jx? 
1 3y 
a | f e*+Ydx dy 
o Jo 


In Exercises 25 to 27, integrate the given function f over the given region D. 


25. f(x, y) =x — y; D is the triangle with vertices (0, 0), 


(1, 0), and (2, 1). 


26. f(x, y) =x3y + cosx; D is the triangle defined by 
0<x <7/2,0<y <x. 


27. f(x, y) =x? + 2xy2 + 2; D is the region bounded by 
the graph of y = —x? + x, the x axis, and the lines 
X = Oand x =2. 


In Exercises 28 and 29, sketch the region of integration, interchange the order, and evaluate. 


A pk 
28. f | (x? + y?) dy dx 
1/1 


1 1 
29, f / (x + y?) dx dy 
0 J1-y 


30. Show that 


4e° < I eX +Y dA < de25, 
[1,3]x[2,4] 


3L Show that 


Ax < f f 0è +y? + Waray = 20%, 
D 


where D is the disk of radius 2 centered at the origin. 


Evaluate the integrals in Exercises 34 to 36. 


1 pz y 
3A. f f f xy?z3dx dy dz 
o Jo Jo 
1 py px/v3 x 
35. I e zg gi tz ox dy 


32. Suppose W is a path-connected region; that is, given 
any two points of W there is a continuous path joining 
them. If f is acontinuous function on W, use the 
intermediate-value theorem to show that there is at least 
one pointin W at which the value of f is equal to the 
average of f over W ; that is, the integral of f over W 
divided by the volume of W . (Compare this with the 
mean-value theorem for double integrals.) W hat happens 
if W is not connected? 


33. Prove: KI F (u) du] dt = falx — u)F (u) du. 


2 pz p2 
36. J / f yz?dx dy dz 
1 J1 Jisy 


37. Write the iterated integral f is T f(x, y, z) dz dy dx 
as an integral over a region in R3 and then rewrite it in 
five other possible orders of integration. 
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Tne Change of Variables 
Formula and Applications 
of Integration 


If you are stuck in a calculus problem and don't know what else to do, try integrating by 


parts or changing variables. —Jerry Kazdan 


God does not care about our mathematical difficulties. He integrates empirically. 
—Albert Einstein 


The change of variables formula is one of the most powerful integration 
methods in single-variable calculus; it enables us to evaluate integrals 


such as 


1 2 
f xe” dx 
0 


by using the substitution, or change of variables u = x*, which reduces 
the problem to the easy task of integrating e“ with respect to u. In this 
chapter, we develop the multidimensional change of variables formula, 
which is especially important and useful in evaluating multiple integrals 
in polar, cylindrical, and spherical coordinates. 

One of the key ingredients in the change of variables formula is how to 
change variables in several dimensions. This involves the notion of map- 
ping, which occurs in various interesting situations. For example, consider 
a deforming object, such as a swimming fish. As it changes its shape, one 
can imagine the instantaneous correspondence between points on the 
fish in its rest state and in its current shape. This type of correspondence is, 
in fact, the main idea behind a change of variables, in this case, of 
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one three-dimensional region (the fish in its rest state) to another (the fish 
in its current shape). 

The first section in this chapter describes the key concepts for 
mappings between regions of the plane. It goes on to develop the 
change of variables technique for double and then triple integrals. The 
chapter also includes some of the important physical applications of 
the integral. 


6.1 The Geometry of Maps from R? to R? 


figure 6.1.1 A function T froma 
region D* to a disk D. 


example | 


solution 


In this section, we shall be interested in maps from subsets of R? to R?. The resulting 
geometric understanding will be useful in the next section, when we discuss the change 
of variables formula for multiple integrals. 


Maps of One Region to Another 


Let D* be a subset of IR?; suppose we consider a continuously differentiable map 
T:D* — R?, so T takes points in D* to points in R?. We denote the set of image 
points by D or by T(D*); hence, D = T(D*) is the set of all points (x, y) € R? such 
that 


(x,y) =T(x*,y*)  forsome  (x*,y*) e D*. 


One way to understand the geometry of a map T is to see how it deforms or changes 
D*. For example, Figure 6.1.1 illustrates a map T that takes a slightly twisted region 
into a disk. 


a ° 


Let D* c R? be the rectangle D* = [0, 1] x [0, 27]. Then all points in D* are of the 
form (r, 0), whereO <r < 1,0 < @ < 2x.LetT be the polar coordinate “change of 
variables” defined by T (r, 6) = (r cos@,r sin 0). Find the image set D . 


Let (x, y) = (r cosé,r sin@). Because of the identity x? + y? = r? cos? 0 +r? sin? 0 = 
r? < 1, we see that the set of points (x, y) € R? such that (x, y) € D has the property 
that x? + y? < 1, and so D is contained in the unit disk. In addition, any point (x, y) in 


6.1 The Geometry of Maps from R? to R? 309 


the unit disk can be written as (r cos8, r sing) for some 0 <r < landO < 0 < 2r. 
Thus, D isthe unit disk (see Figure 6.1.2). 


0 y 


T: (r, 0) — (r cos 9, r sin 8) = (x, y) 


figure 6.1.2 T gives a change of variables between Euclidean and polar coordinates. The 
unit circle is the image of a rectangle. A 


example 2 | LetT be defined by T(x, y) = ((x + y)/2, (x — y)/2) and let D* = [—1, 1] x 
[—1, 1] c R? bea square with side of length 2 centered at the origin. Determine the 
image D obtained by applying T to D*. 


solution Let us first determine the effect of T on the line q(t) = (t, 1), where -1 <t < 1 
(see Figure 6.1.3). We have T(q(t)) = ((t + 1)/2, (t — 1)/2). The map t => T 
(q(t)) is a parametrization of the line y = x — 1,0 < x < 1, because (t — 1)/ 
2 = (t + 1)/2 — 1. This is the straight line segment joining (1, 0) and (0, —1). 


C1, 1) 


figure 6.1.3 Domain for the transformation T of 
u Example 2. 


Gq 
(-1, -1) 
Let 
&(t) = (1,t), -1 <t<1 
q(t) = (t, —1), =le<t=1 


be parametrizations of the other edges of the square D *. Using the same argument as 
before, we see that T o © is a parametrization of the line y = 1—x,0 < x < 1 
[the straight line segment joining (0, 1) and (1, 0)]; T o & istheline y =x +1, —1 < 
x < 0 joining (0,1) and (—1, 0); and T oq is the line y = —x —1,-1 <x <0 
joining (—1, 0) and (0, —1). By this time it seems reasonable to guess that T “flips” 
the square D* over and takes it to the square D whose vertices are (1, 0), (0, 1), 
(—1, 0), (0, —1) (Figure 6.1.4). 
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figure 6.1.4 The effect of T on the 
region D*. 


T1, -1) , TQ, 1) =(, 0) 


To provethatthisisindeed the case, let—1 < aw < landletL, (Figure6.1.3) beafixed 
line parametrized by dt) = (a, t), -1 < t < 1; then T (qt)) = ((a +t) /2, (a —t)/2) 
is a parametrization of the line y = —x+a,(a—1)/2 < x < (a@+4+1)/2. This 
line begins, for t= — 1, at the point ((œ — 1)/2, (1 +&«)/2) and ends at the point 
((1+a)/2, (æ —1)/2); as is easily checked, these points lie on the lines T o & and 
T o G, respectively. Thus, as œ varies between —1 and 1, L, sweeps out the square D * 
while T (L.,) sweeps out the square D determined by the vertices (—1, 0), (0, 1), (1, 0), 
and (0, —1). A 


Images of Maps 
The following theorem is a useful way to describe the image T (D *). 


Theorem | Let A bea2 x 2 matrix with det A + 0 and let T be the linear 
mapping of R? to R? given by T (x) = Ax(matrix multiplication). Then T trans- 
forms parallelograms into parallelograms and vertices into vertices. M oreover, if 
T (D*) isa parallelogram, D* must be a parallelogram. 


The proof of Theorem 1 is left as Exercises 14 and 16 at the end of this section. This 
theorem simplifies the result of Example 2, because we need only find the vertices of 
T (D*) and then connect them by straight lines. 


One-to-One Maps 


Although we cannot visualize the graph of a function T : R? —> R?, it does help to con- 
sider how the function deforms subsets. H owever, simply looking at these deformations 
does not give us a complete picture of the behavior of T. We may characterize T further 
by using the notion of a one-to-one correspondence. 


Definition A mapping T is one-to-one on D* if for (u, v) and (u’, v’) e D*, 
T (u, v) = T (u’, v’) implies that u = u’ and v = v’. 


This statement means that two different points of D * are not sent into the same point 
of D by T . For example, thefunction T (x, y) = (x? + y?, y*) is not one-to-one, because 
T(1,—1) = (2, 1) = T (1, 1), and yet (1, —1) ¥ (1, 1). 


example 3 


solution 


example 4 


solution 
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Consider the polar coordinate mapping function T : R? — R? described in Example 1, 
defined by T(r, 6) = (r cos6,r sin@). Show that T is not one-to-one if its domain is 
all of R?. 


If 0; Æ 62, then T (0, 6;) = T (0, 62), and so T cannot be one-to-one. This observation 
implies that if L is the side of the rectangle D* = [0, 1] x [0, 27] where 0 < 6 < 2x 
andr = 0 (Figure 6.1.5), then T maps all of L into a single point, the center of the 
unit disk D. However, if we consider the set S* = (0, 1] x [0, 2x), then T : S* — S is 
one-to-one (see Exercise 5). Evidently, in determining whether a function is one-to-one, 
the domain chosen must be carefully considered. 


6 y 
ae TL) 
. 
L 7 7 >` 
D 
X 
1 
r 
1 


figure 6.1.5 The polar coordinate transformation T takes the line L to the 
point (0, 0). A 


Show that the function T : R? —> R? of Example 2 is one-to-one. 


Suppose T (x, y) = T (x’, y’); then 


x+y x-y B x+y x-y 
2 ' 2 = 2 °° 2 


and we have 
x+y=xX +y’, 
X-y=x'-y’. 
Adding, we have 
2X = 2x’. 


Thus, x = x’ and, similarly, subtracting gives y = y’, which shows that T is one-to-one 
(with domain all of R?). Actually, because T is linear and T (x) = Ax, where A is a 
2 x 2 matrix, it would also suffice to show that det A 40 (see Exercise 12). A 


Onto Maps 


In Examples 1 and 2, we have been determining the image D = T (D *) of a region D* 
under a mapping T. What will be of interest to us in the next section is, in part, the 
inverse problem: Namely, given D and a one-to-one mapping T of R? to R?, find D* 
such that T (D*) =D. 
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example 5 


example 6 


solution 


example 7 


solution 


figure 6.1.6 We seek a region 
D* in the 0r plane whose 
image under the polar 
coordinate mapping is D. 


Before we examine this question in more detail, we introduce the notion of “onto.” 


Definition The mapping T is onto D if for every point (x, y) € D there exists 
at least one point (u, v) in the domain of T such that T (u, v) = (x, y). 


Thus, if T is onto, we can solve the equation T (u, v) = (x, y) for (u, v), given 
(x,y) €e D.IfT is, in addition, one-to-one, this solution is unique. 

For linear mappings T of R? to R? (or R” to R”) it turns out that one-to-one and onto 
are equivalent notions (see Exercises 12 and 13). 

If we are given a region D and a mapping T , the determination of a region D * such 
that T (D*) = D will be possible only when for every (x, y) e D thereisa(u, v) inthe 
domain of T such that T (u, v) = (x, y) (that is, T must be onto D). The next example 
shows that this cannot always be done. 


LetT : R? — R? begiven by T (u, v) = (u, 0).Let D bethesquare, D = [0, 1] x [0, 1]. 
BecauseT takes all of R? to oneaxis, itis impossible to finda D* such thatT (D*) = D. 
A 


Let us revisit Example 2 using these methods. 


LetT bedefined asinExample2 and let D bethe square whose vertices are(1, 0), (0, 1), 
(—1, 0), (0, —1). Finda D* with T(D*) =D. 


BecauseT is linear and T (x) = Ax, where A is a2 x 2 matrix satisfying det A 4 0, we 
know that T : IR? —> R? is onto (see Exercises 12 and 13), and thus D * can be found. By 
Theorem 1, D* must be a parallelogram. In order to find D *, it suffices to find the four 
points that are mapped onto vertices of D ; then, by connecting these points, we will have 
found D*. For the vertex (1, 0) of D, we must solve T (x, y) = (1, 0) = ((x + y)/2, 
(x — y)/2), so that (x + y)/2 = 1, (x — y)/2 = 0. Thus, (x, y) = (1, 1) is a vertex 
of D*. Solving for the other vertices, we find that D* = [—1, 1] x [—1, 1]. This isin 
agreement with what we found more laboriously inExample2. A 


Let D bethe region in the first quadrant lying between the arcs of the circles x? + y? = 
a*, X? + y? = b*,0 < a < b (see Figure 6.1.6). These circles have equationsr = a 
andr = b in polar coordinates. Let T be the polar-coordinate transformation given by 
T(r, 0) = (r cosé@,r sin) = (x, y). Find D* such thatT (D*) = D. 


In the region D,a? < x? + y? < b?; and becauser? = x? + y?, weseethata <r <b. 
Clearly, for this region 6 varies between 0 < 6 < x/2. Thus, if D* = [a, b] x [0, 2/2], 
we have T (D*) = D and T is one-to-one. A 


remark The inverse function theorem discussed in Section 3.5 is relevant to the 
material here. It states that if the determinant of DT (Uo, vo) [which is the matrix of 
partial derivatives of T evaluated at (Uo, vo)] is not zero, then for (u, v) near (Uo, vo) 
and (x, y) near (Xo, Yo) = T (Uo, vo), the equation T (u, v) = (x, y) can be uniquely 
solved for (u, v) as functions of (x, y). In particular, by uniqueness, T is one-to-one 
near (Uo, vo); also, T is onto a neighborhood of (Xo, Yo), because T (u, v) = (x, y) is 
solvable for (u, v) if (x, y) is near (Xo, Yo). 
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However, even if T is one-to-one near every point, and also onto, T need not be 
globally one-to-one. Thus, we must exercise caution (see Exercise 17). 


Surprisingly, if D* and D are elementary regions and T : D* —> D has the property 
that the determinant of DT (u, v) is not zero for any (u, v) in D*, and if T maps the 
boundary of D* in a one-to-one and onto manner to the boundary of D, then T is 
one-to-one and onto from D* to D. (This proof is beyond the scope of this text.) 


In summary, we have: 


One-to-One and Onto Mappings A mapping T: D* —> D is one-to-one 
when it maps distinct points to distinct points. It is onto when the image of D* 


under T is all of D. 


A linear transformation of R” to R” given by multiplication by a matrix A is 
one-to-one and onto when and only when det A + 0. 


exercises 


L Determine if the following functions T : R? > R? are 


one-to-one and/or onto. 


(a) T(x, y) = (2x, y) 

(b) T(x, y) = (x, y) 

(c) T(x, y) = (7%, YY) 
(d) T(x, y) = (sin x, cosy) 


Determine if the following functions T : R? + R? are 
one-to-one and/or onto. 


(a) T(x, y, z) = (2x + y + 3z, 3y — 4z, 5x) 
(b) T(x, y, z) = (y sinx, zcosy, xy) 

(c) T(x, y,z) = (xy, yz, Xz) 

(d) T(x, y,z) = (e%, eY, e) 


Let D be a square with vertices 

(0, 0), (1, 1), (2, 0), (1, —1) and D* bea parallelogram 
with vertices (0, 0), (1, 2), (2, 1), (1, —1). Find a linear 
map T taking D* onto D. 


Let D bea parallelogram with vertices 

(0, 0), (—1, 3), (—2, 0), (—1, —3). Let 

D* =[0, 1] x [0, 1]. Find alinear map T such that 
T(D*) =D. 


Let S* = (0, 1] x [0, 27) and define 
T(r, 0) = (r cosé,r sing). Determine the image set S. 
Show that T is one-to-one on S*. 


6. Define 


roe = (7 =y X z) 


J A 


Show that T rotates the unit square, D* = [0, 1] x [0, 1]. 


» Let D* =[0, 1] x [0, 1] and define T on D* by 


T (u, v) = (—u? + 4u, v). Find the image D. Is T 
one-to-one? 


Let D* be the parallelogram bounded by the lines 

y = 3x — 4, y = 3x, y = dx, and y = 4(x +4). Let 

D =[0, 1] x [0,1]. Find aT such that D is the image of 
D* under T. 


Let D* =[0, 1] x [0, 1] and define T on D* by 

T (x*, y*) = (x*y*, x*). Determine the image set D . Is 
T one-to-one? If not, can we eliminate some subset of 
D* so that on the remainder T is one-to-one? 


Let D* be the parallelogram with vertices at 

(—1, 3), (0, 0), (2, —1), and (1, 2), and D be the 
rectangle D = [0, 1] x [0, 1]. Find aT such that D is 
the image set of D* under T . 


Let T: R? — R? be the spherical coordinate mapping 
defined by (p, ġ, 0) +> (xX, y, z), where 


X=psingcosé, y=psingsindé, z=pcos®@. 
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Let D* be the set of points (p, ø, @) such that T one-to-one? If not, can we eliminate some subset of 
o €[0, x], 6 € [0, 27], p €[0, 1]. Find D = T(D*). Is D* so that, on the remainder, T will be one-to-one? 


In Exercises 12 and 13, let T(x) = Ax where A is a 2 x 2 matrix. 


12. Show that T is one-to-one if and only if the determinant 15. A mapT : R? > R? is called affineif T(x) = Ax+v, 


of A is not zero. 


where A is a2 x 2 matrix, and vis a fixed vector 
in R2. Show that Exercises 12, 13, and 14 hold for T. 


13. Show that det A + 0 if and only if T is onto. 


16. Suppose T: R? — R? is as in Exercise 14 and that 


1A. Suppose T: R? — R? is linear and is given by T (x) = T(P*) = P isaparallelogram. Show that P * is 
Ax, where A is a2 x 2 matrix. Show thatif det A 4 0, a parallelogram. 
then T takes parallelograms onto parallelograms. [HINT: 
The general parallelogram in IR? can be described by the 17. Show that T is not one-to-one. 


set of points q= p+Av+ uwfor à, u € (0, 1) where 
p, v, ware vectors in R? with vnotascalar multiple of w] 
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figure 6.2.1 The map T: (u, v) => 
(=u + 4u, v) takes the square D* 
onto the rectangle D. 


Given two regions D and D* in R?, a differentiable map T on D* with image D — that 
is, T(D*) = D—and any real-valued integrable function f: D —> R, we would like to 
express ffo f(x, y) dA as an integral over D * of the composite function f oT. In this 
section we shall see how to do this. 

Assume that D* is a region in the uv plane and that D is a region in the xy plane. 
The map T is given by two coordinate functions: 


T (u, v) = (x(u, v), y(u, v)) for (u,v) e D*. 


At first, we might conjecture that 


1 f(x, y) dxdy 2f f(x(u, v), y(u, v)) du dv, (1) 
D D* 


where f o T(u, v) = f(x(u, v), y(u, v)) is the composite function defined on D*. 
However, if we consider the function f: D —> R? where f (x, y) = 1, then equation (1) 


would imply 
ao) = ff dx dy 2f du dv = A(D*). (2) 


But equation (2) will hold for only a few special cases and not for a general map T. 
For example, define T by T (u, v) = (—u? + 4u, v). Restrict T to the unit square; that 
is, to the region D* = [0, 1] x [0, 1] in the uv plane (see Figure 6.2.1). Then, as in 


u 
(1, 0) (3, 0) 


example | 
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Exercise 3, Section 6.1, T takes D* onto D = [0, 3] x [0, 1]. Clearly, A(D) 4 A(D*), 
and so formula (2) is not valid. 


Jacobian Determinants 


To rectify the incorrect formula (1), we need a measure of how a transformation 
T : R? — R? distorts the area of a region. This is given by the Jacobian determinant, 
which is defined as follows. 


Definition Jacobian Determinant Le T: D* c R? > R? bea C! trans- 
formation given by x = x(u, v) and y = y(u, v). The Jacobian determinant of 
T, written a(x, y)/a(u, v), is the determinant of the derivative matrix DT (u, v) 
of T: 


OX OX 
A(x, y) _Jau ðv 
a(u,v) jay ay’ 

ðu av 


The function from R? to R? that transforms polar coordinates into Cartesian coordinates 
is given by 


X =I cosd, y=rsing 
and its J acobian determinant is 


a(x, y) 


cos@ -=r sing 
sind r cos 


2 +2 
=r (cos 0 sin 0) =r. 
Jlr, 0) ( + ) 


A 


Under suitable restrictions on the function T , we will argue below that the area of 
D = T(D*) is obtained by integrating the absolute value of the] acobian a(x, y)/a(u, v) 
over D *; that is, we have the equation 


ao) = ff dxdy =), 


To illustrate: From Example 1 in Section 6.1, take T : D* + D, where D =T(D*) 
is the set of (x, y) with x? + y? < land D* = [0,1] x [0,2z], and T(r,6) = 


(r cosé,r sing). By formula (3), 
A(X, y) 7 
weer ar do = ff rar do (4) 


sor- ff 


(herer and 6 play the role of u and v). From the preceding computation it follows that 


Qn pl 2x pp2 1 1 f7 
J rar do = | f rarao= f z do=5 f j= 
D* 0 0 0 2 Jo 2 Jo 


a(x, y 


Ua du dv. (3) 


) 
) 
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figure 6.2.2 The effect of the 
transformation T on a small 
rectangle D*. 


T (Ug, v9) +7 (D*) 


À 4 9 


(Xo Yo) il 
=T (Uo, vo) OX i NG 
Au Ju I + Au ay J 


OX; vi 
Ava i tA] 


is the area of the unit disk D, confirming formula (3) in this case. In fact, we can recall 
from first-year calculus that equation (4) is the correct formula for the area of a region 
in polar coordinates. 

Itis not so easy to rigorously prove assertion (3). However, looked at in the proper 
way, it becomes quite plausible. Recall that A(D) = ffa dx dy was obtained by dividing 
up D into little rectangles, summing their areas, and then taking the limit of this sum as 
the size of the subrectangles tended to zero. The problem is that T may map rectangles 
into regions whose area is not easy to compute. The solution is to approximate these 
images by simpler regions whose area we can compute. A useful tool for doing this is 
the derivative of T , which we know (from Chapter 2) gives the best linear approximation 
toT. 

Consider a small rectangle D* in the uv plane as shown in Figure 6.2.2. Let T’ 
denote the derivative of T evaluated at (Uo, vo), SOT’ iSa2 x 2 matrix. From our work 
in Chapter 2, we know that a good approximation to T (u, v) is given by 


,/ Au 
T (Ug, vo) +T Gea) 


where Au = u — Ug and Av = v — vo. This mapping T’ takes D * into a parallelogram 
with vertex at T (Uo, vo) and with adjacent sides given by the vectors 


OX Ox OX 
5 ou dav Au ðu 
i = = =A T 
T'(Aui) ay ay | 0 | Au ay uTy 
ou dav ou 
and 
[2 =| [2 
au av} [0 ðv 
' = = =A T,, 
T'(Avj) E n | Lae “| i 
ou ðv dv 
where 
Ox. OY, ox oy 
T —i+ — and T, = —i+ — 
«= out aud rae 


are evaluated at (Uo, vo). 


figure 6.2.3 The area of the little 
rectangle Ris Au Av. The area 
of T(R) is approximately 

|aCx, Yy/aCu, v)|AU Av. 
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Recall from Section 1.3 that the area of the parallelogram with sides equal to the 
vectors ai+ bj and cì + dj is equal to the absolute value of the determinant 


a c 
b d 


a b| 
c d| 


Thus, the area of T (D*) is approximately equal to the absolute value of 


ðX Aù SN ox Ox 

—= e v — — 

ðu w OU. laar EN A 
ay ay əy ay a(u, v) 

SLAD Ab) e 

ou dv ðu ðv 


evaluated at (uo, vo). 

This fact and a partitioning argument should make formula (3) plausible. Indeed, 
if we partition D * into small rectangles with sides of length Au and Av, the images 
of these rectangles are approximated by parallelograms with sides T, Au and T, Av, 
and hence with area |a(x, y)/d(u, v)| Au Av. Thus, the area of D* is approximately 
X Au Av, where the sum is taken over all the rectangles R inside D * (see Figure 6.2.3). 
Hence, the area of T (D *) is approximately the sum 5- |a(x, y)/a(u, v)|Au Av. In the 
limit, this sum becomes 


L et us give another informal argument for the special case (4) of formula (3); that is, 
the case of polar coordinates. Consider aregion D inthe xy plane and a grid correspond- 
ing to a partition of ther and 6 variables (Figure 6.2.4). The area of the shaded region 
shown is approximately (Ar )(r jk A@), because the arc length of a segment of a circle 
of radiusr subtending an angle ¢ isr œ. The total area is then the limit of $- r jk Ar Ad; 
that is, ff. dr do. The key idea is thus that the jkth “polar rectangle” in the grid 
has area approximately equal to rj, Ar A@. (For n large, the j kth polar rectangle will 
look like a rectangle with sides of lengths rj, A@ and Ar.) This should provide some 
insight into why we say the “ area element dx dy” is transformed into the “ area element 
rdr d8.” 


du dv. 


a(x, y) 
a(u, v) 
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figure 6.2.4 D* is mapped to D 
under the polar coordinate 


mapping T. 


example 2 | Lettheelementary region D inthe xy plane be bounded by the graph of a polar equation 
r = f(0), where 6 < 6 < 6, and f (0) > 0 (see Figure 6.2.5). In the r@ plane we 
consider the r-simple region D*, where &) < 6 < 6 andO <r < f(@). Under the 
transformation x =r cosé, y =r sin 8, the region D * is carried onto the region D. Use 
equation (4) to calculate the area of D. 


0 


figure 6.2.5 The effect on the region D* of the polar coordinate mapping. 
solution 
a(x, y) 


aio) = ff ixdy = ff alr, 0) 
sj arao= f [f rar dé 
-f J = f o 


This formula for A(D ) should be familiar from one-variable calculus. A 


farag 


Change of Variables Formula 


Before stating the two-variable change of variables formula, which is the culmination 
of this discussion, let us recall the corresponding theorem from one-variable calculus 
that goes under the name the method of substitution: 


b dx x(b) 
f town pau f (x) dx, (5) 


(a) 


where f is continuous and u +> x(u) is continuously differentiable on [a, b]. 
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proof LetF beanantiderivativeof f;thatis, F’ = f, whose existence is guaranteed 
by the fundamental theorem of calculus. The right-hand side of equation (5) becomes 


x(b) 
1 f(x)dx = F (x(b)) — F (x(a)). 


(a) 


To evaluate the left-hand side of equation (5), let G (u) = F (x(u)). By the chain rule, 
G'(u) = F’(x(u))x’(u) = f(x(u))x’(u). Hence, again by the fundamental theorem, 


b b 
f fxw)x'u) du = f G'(u) du = G (b) — G (a) = F (x(b)) — F (x(a), 
as required. E 


Suppose now that we have a C! function u +> x(u) that is one-to-one on [a, b]. 
Thus, we must have either dx/ du > 0 on [a, b] ordx/du < 0 on [a, b].+ Let | * denote 
the interval [a, b], and let | denote the closed interval with endpoints x(a) and x(b). 
(Thus, | = [x(a), x(b)] if u —> x(u) is increasing and | = [x(b), x(a)] ifu re x(u) is 
decreasing.) With these conventions we can rewrite formula (5) as 


[foun du = f f(x). 


This formula generalizes to double integrals, as was already given informally in formula 
(3): |* becomes D*, | becomes D, and |dx/du| is replaced by |a(x, y)/a(u, v)|. Let 
us state the result formally (the technical proof is omitted). 


dx 
du 


Theorem 2 Change of Variables: Double Integrals Let D and D* be 
elementary regions in the plane and let T: D* —> D be of class C 1; suppose that 
T is one-to-one on D*. Furthermore, suppose that D = T(D*). Then for any 
integrable function f: D —> R, we have 


2 A(X, y) 
J Fox y)dedy = ff ttxtu, v), ytu, o) 7%) dude, (6) 


One of the purposes of the change of variables theorem is to supply a method 
by which some double integrals can be simplified. We might encounter an integral 
JJ f dA for which either the integrand f or the region D is complicated and for which 
direct evaluation is difficult. Therefore, a mapping T is chosen so that the integral is 
easier to evaluate with the new integrand f oT and with the new region D * [defined by 
T(D*) = D]. Unfortunately, the problem may actually become more complicated if T 
is not selected carefully. 


lf dx/du is positive and then negative, the function x = x(u) rises and then falls, and thus is not 
one-to-one; a similar statement applies if dx/ du is negative and then positive. 
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example 3 


solution 


Let P be the parallelogram bounded by y = 2x, y = 2x — 2, y = x, andy =x +1 
(see Figure 6.2.6). Evaluate ff, xy dx dy by making the change of variables 


X=U-v, y = 2u — v, 
thatis, T (u, v) = (u — v, 2u — v). 


figure 6.2.6 The effect of T (u, v) = (u— v, 2u— v) on the rectangle P*. 


The transformation T has nonzero determinant and so is one-to-one (see Exercise 12, 

Section 6.1). It is designed so that it takes the rectangle P * bounded by v = 0, v = 

—2, u =0,u = l onto P . The use of T simplifies the region of integration from P to 
Br y) 


P *, M oreover, 
1 —1 
= | det =1. 
ael l 


Therefore, by the change of variables formula, 


0 1 
[f vexay= ff (u~v)(2u —v) dudv = f J (tu? = 30 +02) dude 
P p* -2 J0 
0 2 1 0 
= 
2 


3u?v 2 3 
£ 3 Uv 2 = £9 2 
E 5 +ou) de LE Zoto] do 


Integrals in Polar Coordinates 


Suppose we consider the rectangle D * defined by 0 < 6 < 2m,0 <r <a inthero 
plane. The transformation T given by T (r, @) = (r cos, r sin 8) takes D * onto the disk 
D with equation x? + y? < a? in the xy plane. This transformation represents the 
change from Cartesian coordinates to polar coordinates. However, T does not satisfy 
the requirements of the change of variables theorem, because it is not one-to-one on 
D*: In particular, T sends all points with r = 0 to (0, 0) (see Figure 6.2.7 and Example 
3 of Section 6.1). Nevertheless, the change of variables theorem is valid in this case. 


figure 6.2.7 The image of the 
rectangle D* under the polar 
coordinate transformation is the 
disk D. 


example 4 


solution 
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x2 +y? < a2 


Basically, the reason for this is that the set of points where T is not one-to-one lies on 
an edge of D*, which is the graph of a smooth curve and therefore, for the purpose of 
integration, can be neglected. In summary, the formula is valid when T sends D* onto 
D in a one-to-one fashion, except possibly for points on the boundary of D *. 


Change of Variables—Polar Coordinates 


II F(x, yaxdy= ff flr coso, r sine)r dr do (7) 


Evaluate ff log (x? + y?) dxdy, where D is the region in the first quadrant lying be- 
tween the arcs of the circles x? + y? = a? and x? + y? = b?, where0 <a <b 


(Figure 6.2.8). 
y 
t 
-a 
r X 
(a, 0) (b, 0) 


figure 6.2.8 The polar coordinate mapping takes a rectangle D* onto part of 
an annulus D. 


0 


These circles have the simple equationsr = a andr = b in polar coordinates. M oreover, 
r? = x? + y? appears in the integrand. Thus, a change to polar coordinates will simplify 
both the integrand and the region of integration. From Example 7, Section 6.1, the polar 
coordinate transformation 


X =T cosd, y =r sin 
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example 5 


sends the rectangle D* given by a <r < b,0 < @ < x/2 onto the region D. This 
transformation is one-to-one on D* and so, by formula (7), we have 


b px /2 
J| toate? + yay J Í rlogr?dédr 
D a 0 


b b 
= 2gr — 7 
= 5 rlogr“dr | 2r logr dr. 


Applying integration by parts, or using the formula 


x? X 
J ¥logx ax = 5 lgx- > 


from the table of integrals at the back of the book, we obtain the result 


x f Ea 2 l2 2 
=f 2r logr dr = 3 [b logb — a“ loga — 5(b —a‘)}. i 


The Gaussian Integral One of the most beautiful applications of the change of 
variables formula, polar coordinates, and the reduction to iterated integrals is their 
application to the following formula, known as the Gaussian integral: 


J e% dx = V7. 


Not only is this formula very attractive in its own right, but itis also useful in areas such 
as statistics. It also illustrates the unity of the transcendental numbers e and z nearly as 
well as does the classic formula e'” = —1. 

To carry out the integration of the Gaussian integral,? we first evaluate the double 


integral 
i an (X?+y7) dx dy, 
Da 


where D, is the disk x? + y? < a?. Becauser? = x? + y?, and dxdy = r dr d9, the 
change of variables formula gives 


2 2 ax a 2 2x 1 2 
I ety ‘ixdy = | | er dr d8 = (-5e" ) 
Da 0 0 0 2 
1 27 


== | (e* —1)d@=n(l—e*). 
2 Jo 


a 


do 


0 


2The method that follows is admittedly not straightforward but requires a trick. The trick is to start with 
the desired formula and square both sides. You will then observe that the left-hand side resembles an 
iterated integral. There are several other ways to evaluate the Gaussian integral, but all of them require 
some nonobvious method. For the use of complex variables to evaluate it, see, for example, J. M arsden 
and M . Hoffman, Basic Complex Analysis, 3rd ed., W. H. Freeman, New York, 1998. 
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If weleta — oo in this expression, we give meaning to the improper integral and get 


// eV dx dy = r. 
R2 


Assuming (as shown in the Internet supplement) that we can also evaluate this improper 
integral as the limit of the integrals over the rectangles Rg = [—a, a] x [—a, a] as 


a — oo, we get 
lim II eh "dx dy = r. 
a—oo Ra 


By reduction to iterated integrals, we can write this as 


a i a ; a : 2 
lim J e* ax f ey ay = iim f e™* ax =x. 
a> —a —a a—oo —a 
oo 2 
| ean] =n. 


Thus, taking square roots, we arrive at the desired result. 
Here is a variant of the Gaussian integral. Evaluate 


i 2 
J ex. 
—CO 


To do this, use the change of variables formula y = /2x to reduce the problem to the 
Gaussian integral just computed: 


o0 /2a 
fe eX? dx = im f'e e72? oot | eo" s 
eY dy = — s/m = 
-5f y avi V5 


Change of Variables Formula for Triple Integrals 


To state this formula, we first define the J acobian of a transformation from R? to R?— it 
is a simple extension of the two-variable case. 


That is, 


Definition Let T: W c R? —> R? be aC? function defined by the equations 
x = x(u, v, W), y = y(u, v, W), Z = z(u, v, W). Then the J acobian of T , which 
is denoted a(x, y, Z)/a(u, v, w), is the determinant 


ox əx ay 
ðu ðv Ww 
əy dy dy 
ðu ðv aw]. 
oz OL 0z 
ðu ðv OW 
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The absolute value of this determinant is equal to the volume of the parallelepiped 
determined by the three vectors 


əx. dy. az 
Th =< ie je 
“OU Fault au 
ax. dy. az 
Ta ie i 
ð talta k 
ax _ az 
Ty = i ôy j 2 k. 
ow ow ow 


J ust as inthe two-variable case, the] acobian measures how the transformation T distorts 
the volume of its domain. Hence, for volume (triple) integrals, the change of variables 
formula takes the following form: 


Change of Variables Formula: Triple Integrals 


// f(x, y, z) dxdy dz 
Ww 
A(X, y, Z) 


= i f(x(u, v, w), y(u, v, W), z(u, v, W)) Ge 
we 


ear dududw, (8) 


where W * isan elementary region in uvw space corresponding to W in xyz space, 
under a mapping T: (u, v,w) + (x(u, v, Ww), y(u, v, w), z(u, v, W)), provided 
T is of class C t and is one-to-one, except possibly on a set that is the union of 
graphs of functions of two variables. 


Cylindrical Coordinates 


Let us apply formula (8) to cylindrical and then to spherical coordinates. First, we com- 
pute the J acobian for the map defining the change to cylindrical coordinates. Because 


X =r cosd, y =r sinð, Z=Z, 
we have 
cos9 -rsind 0 


sing rcos@ 0 
0 0 1 


=f. 


Thus, we obtain the formula 


Change of Variables—Cylindrical Coordinates 


// F(x, y,2) dxdydz = ff f(r cosé,rsing,z)rdrdoedz. (9) 
W W* 


example 6 


6.2 The Change of Variables Theorem 


Spherical Coordinates 


N ext we consider the spherical coordinate system. Recall that it is given by 


x = psingcosd, y = psingsind, Zz = pcos. 
Therefore, we have 
a(x, y sing@cosé —psingsind pcosg coso 
Jl T =I|sindsind psindcosd pcos@sinéd|. 
p, 6, ) coso 0 —psing 
Expanding along the last row, we get 
A(X, y, Z) ae —psingsind pcoso coso 
ale, op) psingcosð pcos@sing 
ana Singcosð —posingsine 
SIN! Singsing  psing cose 


= — p? cos? ọġ sin ¢ sin? 6 — p? cos? ẹ sin ¢ cos? 6 
— p° sin? ¢ cos? 6 — p? sin? ¢ sin? 0 
= — p cos’ p sing — p*sin? o = -p° sino. 


Thus, we arrive at the formula: 
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III f(x, y, z) dxdy dz 


=) f(psindcosé, psingdsing, pcosd)p*sinddodod@. (10) 
W* 


To prove formula (10), we must show that the transformation S on the set W * is one- 
to-one except on a set that is the union of finitely many graphs of continuous functions. 


We shall leave this verification as Exercise 34. 


Evaluate 


II exp (x? + y? +.22)3/ dV, 
w 


where W is the unit ball in R3. 
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solution First note that we cannot easily integrate this function using iterated integrals (try it!). 
Hence (employing the strategy in the quote that opened this chapter), let us try a change 
of variables. The transformation into spherical coordinates seems appropriate, because 
then the entire quantity x? + y? + z? can be replaced by one variable, namely, p°. If W * 
is the region such that 


we can apply formula (10) and write 


II exp (x? + y? +27) dV = /// pee” sinddpdodd. 
W * 


This integral equals the iterated integral 


L m 2x 1 x 
II] e’ singdodgdo=2r | | ep? singdede 
0 0 0 0 0 


1 
= -2x | pre’ [coso] dp 
0 


example 7 Let W be the ball of radius R and center (0, 0, 0) in R3. Find the volume of W. 


solution | The volume of W is {ff dxdydz. This integral may be evaluated by reducing it to 
iterated integrals or by regarding W as a volume of revolution, but let us evaluate it here 
by using spherical coordinates. We get 


2x 2x 
fff awe- f [fe 2 sing dod do = — S] sing dodo 


3 
rt sing do = Zi- [cos(z) — cos(0)]} = ae 


3 2.3 


which is the standard formula for the volume of a solid sphere. A 


exercises 


L Suggest a substitution/transformation that will simplify 2. Suggest a substitution/transformation that will simplify 
the following integrands, and find their J acobians. the following integrands, and find their J acobians. 


(a) [fs + 2y) sin(x — y) dA (a [fc ad a 
R 


(b) 1 el-4x+7Y) cos(7x — 2y) dA (b) J| xsintex +7) ~3ysin(6x + 79148 
R R 


3. Let D be the unit disk: x? + y? < 1. Evaluate 


1 exp (x? + y?) dx dy 
D 


by making a change of variables to polar coordinates. 


4. Let D bethe region 0 < y < x and 0 < x < 1. Evaluate 


[fe nenay 
D 


by making the change of variables x = u + v, 
y =u — v. Check your answer by evaluating the integral 
directly by using an iterated integral. 


5 Let T(u, v) = (x(u, v), y(u, v)) be the mapping defined 
by T (u, v) = (4u, 2u + 3v). Let D * be the rectangle 
[0, 1] x [1, 2]. Find D = T (D *) and evaluate 


(a) I xy dx dy 
D 

(b) [fo -nexay 
D 


by making a change of variables to evaluate them as 
integrals over D *. 


G Repeat Exercise 5 for T (u, v) = (u, v(1 + u)). 


7. Evaluate 


J| dxdy 
p VIFXx+?2y" 


where D = [0, 1] x [0, 1], by setting T (u, v) = (u, v/2) 
and evaluating an integral over D*, where T (D*) =D. 


& DefineT (u, v) = (u? — v2, 2uv). Let D * be the set 
of (u, v) with u? + v? < 1, u > 0, v > 0. Find 
T(D*) =D. Evaluate f fy dx dy. 


9. Let T (u, v) be as in Exercise 8. By making a change of 
variables, “formally” evaluate the “improper” integral 


I dx dy 

D yx? +y? 

[NoTE: This integral (and the one in the next exercise) is 
improper, because the integrand 1/./x2 + y? is neither 
continuous nor bounded on the domain of integration. 


(The theory of improper integrals is discussed in 
Section 6.4.)] 


10. Calculate fi. —} aydx, where R is the region 
RX+Y 


bounded by x =0,y=0,x+y=1,x+y = 4, by 
using the mapping T (u, v) = (u — Uv, Uv). 
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IL Evaluate / / (x? + y2)?/2dxdy, where D is the disk 


D 
x? +y? <4, 


12. Let D* bea v-simple region in the uv plane bounded by 
v = g(u) and v =h(u) < g(u) fora <u <b. Let 
T: R2 — R? be the transformation given by x = u and 
y = y(u, v), where w is of class C t and aw/av is never 
zero. Assume that T (D*) = D isa y-simple region; 
show that if f: D — R is continuous, then 


If Fox, y)dxay= |f Flu, ytu, o) $E | dude. 
D D* u 


13. Use double integrals to find the area inside the curve 
r =1+sinð. 


14. (a) Express i (Re xy dy dx as an integral over the 
triangle D *, which is the set of (u, v) where 
0<u <1,0 <v <u. (HINT: Find a one-to-one 
mapping T of D* onto the given region of 
integration.) 


Evaluate this integral directly and as an integral 
over D*. 


(b 


15. Integrate ze**+Y? over the cylinder 
xX? +y? 24,2 27 <3. 


16. Let D be the unit disk. Express 
(1+ x? + y2)3/2dx dy as an integral over 


D 
[0, 1] x [0, 27x] and evaluate. 


17. Using polar coordinates, find the area bounded by the 
lemniscate (x? + y?)? = 2a? (x? — y?). 


18. Redo Exercise 15 of Section 5.3 using a change of 
variables and compare the effort involved in each 
method. 


19. Calculate (x + y)2e*-Ydx dy, where R is the region 


R 
bounded by x + y = 1, x + y = 4, X —y = —1, and 
x-y=l. 

20. Let T: R? — R? be defined by 

T(u, v, w) = (u cos v cosw, u sin v cosw, u sinw). 


(a) Show that T is onto the unit sphere; that is, every 
(x, y, z) with x? + y? + z? = 1 can be written as 
(x, y,Z) =T (u, v, w) for some (u, v, w). 


(b) Show that T is not one-to-one. 


2L Integrate x? + y? + z? over the cylinder 
x? +y? <2,-2<2<3. 
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22. Evaluate [5° e2 dy, 


23. Let B be the unit ball. Evaluate 
I dx dy dz 
B y/2+x? +y? +422 


by making the appropriate change of variables. 


24. Evaluate / / [1/(x? + y?)?]dxdy, where A is 


A 
determined by the conditions x2 + y? < 1 and 
x+y>21. 


dx dy dz , 
25. Evaluate | | | pya eel is the 


solid ounde by the two spheres x2 + y? + z? = a? 
and x2 + y? + z? = b?, where 0 <b <a. 


26. Use spherical coordinates to evaluate: 


LEE” 


27. Let D bea triangle in the (x, y) plane with vertices 
(0, 0), (3, £), (1, 0). Evaluate: 


je 


by making the appropriate change of variables. 


V2 + y2 +22 


AEE dz dy dx 


28. Evaluate I x?dx dy, where D is determined by the 


D 
two conditions 0 < x < y and x? + y? < 1. 

29. Integrate \/x2 + y2 + z2 e-(*’+Y"+2") over the region 
described in Exercise 25. 


30. Evaluate the following by using cylindrical coordinates. 


(a) JII z dx dy dz, where B is the region within the 


cylinder x? + P = 1 above the xy plane and below 
the cone z = (x2 + y2)1/2 


(b) II (x? + y? +27)~/2dx dy dz, where W is the 


eon seins by the conditions 4 53 <2Z<1and 
x? +y? +z? <1 


3L Evaluate (x + y) dx dy, where B is the rectangle in 


the xy plana with vertices at (0, 1), (1, 0), (3, 4), and 
(4, 3). 


32. Evaluate I (x + y) dx dy, where D is the square with 


D 
vertices at (0, 0), (1, 2), (3, 1), and (2, —1). 


33. Let E be the ellipsoid (x?/a?) + (y?/b?) + (z*/c?) < 1, 
where a, b, and c are positive. 
(a) Find the volume of E. 
(b) Evaluate 


// [(x2/a2) + ( y2/b2) + (z2/c?)] dx dy dz. 
E 


(HINT: Change variables and then use spherical 
coordinates.) 


34. Using spherical coordinates, compute the integral of 
f (o, @, 0) = 1/p over the region in the first octant of 
R3, which is bounded by the cones ¢ = 7/4, 
¢ = arctan 2 and the sphere p = V6. 


35. The mapping T (u, v) = (u? — v2, 2uv) transforms the 
rectangle 1 < u < 2,1 < v < 3 of theuv plane into a 
region R of the xy plane. 


(a) Show that T is one-to-one. 


(b) Find the area of R using the change of variables 
formula. 


36. Let R denote the region inside x? + y? = 1, but outside 
x? +y? = 2y withx > 0, y > 0. 


(a 
(b 


Sketch this region. 


Letu = X? + y?, v = x? + y? — 2y. Sketch the 
region D in the uv plane, which corresponds to 
R under this change of coordinates. 


Compute I xe’ dx dy using this change of 
R 


coordinates. 


(c 


37. Let D bethe region bounded by x3/2 + y3/2 = a3/2, for 
x > 0, y = 0, and the coordinate axes x = 0, y = 0. 


Express I f(x, y) dx dy as an integral over the 
D 


triangle D *, which is the set of points 
0<u<a,0<v <a -— u. (Do not attempt to evaluate.) 


38. Show that S(p, 0, ¢) = (posin cos98, psingsing, 
pcos $), the spherical change-of-coordinate mapping, is 
one-to-one except on a set that is a union of finitely 
many graphs of continuous functions. 
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6.3 Applications 


In this section, we shall discuss average values, centers of mass, moments of inertia, 
and the gravitational potential as applications. 


Averages 


If Xi, ..., Xn aren numbers, their average is defined by 


Xi +---+X 
[xi lv = : T= Son 


Notice that if all the x; happen to have a common value c, then their average, of course, 
also equals c. 
This concept leads us to define the average values of functions as follows. 


Average Values Theaveragevalueof a function of one variable on the interval 


[a, b] is defined by 
i f(x) dx 
LT ly = S, 


b—a 
Likewise, for functions of two variables, the ratio of the integral to the area of D, 


PE ihe) ” 
foy 


is called the average value of f over D. Similarly, the average value of a function 
f on aregion W in 3-space is defined by 


i f(x,y, Sa 


n dx dy dz 


Again, notice that the denominator is chosen so that if f is a constant, say c, then 
[f la =c. 


example | Find the average value of f(x, y) = x sin?(xy) on the region D = [0, x] x [0, 7]. 


[fla = 


solution First, we compute 


JI f(x, y) axdy = f [ x sin?(xy) dx dy 
-f if 1—cos(2xy) | x ay i 
0 0 2 


= [fy  sin(2xy)] |7 
=| 5 E 4x | y=0 á 
0 A 2 T 
=f ES y _ E= aa 
0 2 4 4 8x 0 
m? cos(2x?) —1 
= + . 


4 82 
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example 2 


solution 


figure 6.3.1 The lever is 
balanced if © (x — X) m; = 0. 


Thus, the average value of f, by formula (1), is 


3 2) _ J= 
mx?/4 + [cos(2x4) — 1]/8x = 3% cos(2x*) — 1 zx 0.7839. A 
m? 4 8r? 


The temperature at points in the cube W = [—1, 1] x [—1, 1] x [—1, 1] is proportional 
to the square of the distance from the origin. 


(a) What is the average temperature? 
(b) At which points of the cube is the temperature equal to the average temperature? 


(a) Let c be the constant of proportionality, so T = c(x* + y? + z?) and the average 
temperature is [T ] = 3 ff fy T dx dy dz, because the volume of the cube is 8. Thus, 


c uf 1 1 
Mien F J / (x2 + y? + z?) dxdy dz. 
ee ee 


The triple integral is the sum of the integrals of x2, y?, and z*. Because x, y, and z enter 
symmetrically into the description of the cube, the three integrals will be equal, so that 


1 1 1 1 1 1 
ia Sf / J z?dx dy dz = Sf z (/ i éxdy) dz. 
8 Jaa Ja JA 8 J-i =J 


The inner integral is equal to the area of the square [—1, 1] x [—1, 1]. The area of that 


square is 4, and so 
30 fs 30 2? 
Tiga Fe d= > (5) 


(b) The temperature is equal to the average temperature at all points satisfying 
c(x? + y? +z?) =c—thatis, at all points lying on the sphere x? + y?+z7=1. A 


1 
=. 
=1 


Centers of Mass 


If masses mı, ..., Mn are placed at points x1, ..., Xn on the x axis, their center of mass 
is defined to be 


X mixi 
X= Sm; . 


This definition arises from the following observation: If one is balancing masses on a 
lever (Figure 6.3.1), the balance point X occurs where the total moment (mass times 
distance from the balance point) is zero; that is, where X` mi(x;i — X) = 0. A physical 
principle, going back first to Archimedes and then in this generality to Newton, states 
that this condition means there is no tendency for the lever to rotate. 


(2) 


Center of mass Plate 


figure 6.3.2 The plate 
balances when supported 
at its center of mass. 


example 3 


solution 
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For a continuous mass density 5(x) along the lever (measured in, say, grams/cm), 
the analogue of formula (2) is 
[x00 0 
Jo (3 


[soa . 


For two-dimensional plates, this generalizes to: 


i= 


x 


The Center of Mass of Two-Dimensional Plates 


= xô(x, y) dx dy ill yd(x, y) dx dy 7 
~ ih 5(x, y) dx dy ffs xe y) dx dy 


where again 5(x, y) is the mass density (see Figure 6.3.2). 


Find the center of mass of the rectangle [0, 1] x [0, 1] if the mass density is e**’, 


First we compute the total mass: 
1 
e*tYdxdy = tidk dy = ey]! Jdy= | (e — e) dy 
x=0 0 
elt meh, =e? ea =e —2e+1. 


The numerator in formula (4) for x is 


1 pl 1 1 
a xeMdxdy = | (xe e) |i dy = f eH — e1 — (06 — e)1dy 
0/0 0 0 
1 1 
= e'dy =e | =e-l, 
0 y=0 


e—1 e—1 1 
X= = = ~ 0.582. 
e —2e+1 (e-—1)? e-1 


so that 


Theroles of x and y may beinterchangedin all these calculations, sothaty = 1/(e—1) ~ 
0.582 aswell. A 


For a region W in space with mass density 5(x, y, z), we know that 


volume = JII dx dy dz, (5) 
mass = If 5(X, y, Z) dx dy dz. (6) 


If we denote the coordinates of the center of mass by (X, y, Z), then the generalization 
of the formulas in the preceding box are as follows. 
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example 4 


solution 


example 5 


solution 


Coordinates for the Center of Mass of Three-Dimensional 


Regions 
If Xd(x, y, Z) dx dy dz 
X= W 


mass 


= yô(x, y, Z) dx dy dz 


mass 


7 [Tl zô(x, y, z) dx dy dz 


mass 


The cube [1, 2] x [1, 2] x [1, 2] has mass density given by &(x, y,z) = (1+ x)e?y. 
Find the total mass of the box. 


The mass of the box is, by formula (6), 


1+ x)e’y dx dy dz = 2 e7 “i 
LE LE |e, 
-f| [ vye- y Dak 2=[Bel = Fie- A 


If a region and its mass density are reflection-symmetric across a plane, then the 
center of mass lies on that plane. For example, in formula (7) for x, if the region and 
mass density are symmetric in the yz plane, then the integrand is odd in x, and so X = 0. 
This kind of use of symmetry is illustrated in the next example. 


Find the center of mass of the hemispherical region W defined by the inequalities 
x? + y? +z? < 1,z > 0. (Assume that the density is unity.) 


By symmetry, the center of mass must lie on the z axis, and so X = y = 0. To find Z, 


we must compute, by formula (7), the numerator | = ff fy z dx dy dz. The hemisphere 
is an elementary region, and thus the integral becomes 


1- Jf1-22 1- J/1-y—2 J/1-y—2 
l -f F z dx dy dz. 
fis =< t= Jy? Jy? 


Because z is a constant for the x and y integrations, we can remove it from the first two 


integral signs, to obtain 
1- fiz 1- J1-y-2 J1-y-2 
=f dxdy | dz. 
-y1-2 J- /1-y2=2 


Instead of calculating the inner two integrals explicitly, we observe that they equal the 
double integral ff, dxdy over the disk x? + y? < 1 — z?, considered as an x-simple 


Historical Note 


figure 6.3.3 Equilibrium positions 
of the human body, to be 
observed by the painter. The 
center of mass should be 
supported to maintain 
equilibrium. 
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region in the plane. The area of this disk is x(1— z?), and so 


1 1 2 4 
= -Ay e eee | ae S 
L=] z(1 Pdz—x f (z Pde =n [5 =F 


The volume of the hemisphere is 37, and so Z = (7/4)/(37) =}. A 


It is common knowledge that Archimedes observed the principle of the lever. 
Perhaps less known is that he was also responsible for discovering the concepts 
of center of mass and center of gravity. Only two of his works on mechanics have 
been handed down to us: On Floating Bodies and On the Equilibrium and 
Centers of Mass of Plane Figures. Both were translated into Latin by Niccolo 
Tartaglia, circa 1543. 

In Equilibrium... , Archimedes began the field of applied mathematics, doing 
for mechanics what Euclid had accomplished for geometry. In this work he 
describes the principles behind all the machines of antiquity, including the lever, 
inclined plane, and pulley systems. 

Surprisingly, Archimedes never carefully defined the center of mass; the first 
proper definition was given by Pappus of Alexandria in 340 c.e. The concept of 
equilibrium was to have a profound effect on the development of mechanical 
engineering (through the introduction of gears), architecture, and in art, 
permitting the construction of complex machines, large-scale buildings, and 
sculptures. Figure 6.3.3 shows sketches by Leonardo DaVinci, illustrating 
equilibrium positions of the human body. 


Moments of Inertia 


Another important concept in mechanics, one that is needed in studying the dynamics 
of a rotating rigid body, is that of moment of inertia. If the solid W has uniform density 
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example 6 


solution 


figure 6.3.4 The gravitational 
potential that produces a force 
acting on a mass mat (x1, yy, 21) 
arising from the mass dM = 6 

(x, y, D dx dy az at (x, y, 2 is 
—(Gmô (x, y, D dx dy az)/r. 


ô, the moments of inertia |,, |y, and |, about the x, y, and z axes, respectively, are 
defined by: 


Moments of Inertia About the Coordinate Axes 


= ff (y? + z?) sdxdy dz, Iy = fff 0+2) sdxdye, 
w W 
i= ff (x? + y?) ô dx dy dz. 
W 


The moment of inertia measures a body’s response to efforts to rotate it; for example, 
as when one tries to rotate a merry-go-round. The moment of inertia is analogous to 
the mass of a body, which measures its response to efforts to translate it. In contrast to 
translational motion, however, the moments of inertia depend on the shape and not just 
the total mass. Itis harder to spin up a large plate than a compact ball of the same mass. 

For example, |, measures the body’s response to forces attempting to rotate it about 
the x axis. The factor y? + 22, which is the square of the distance to the x axis, weights 
masses farther away from the rotation axis more heavily. This is in agreement with the 
intuition just explained. 


(8) 


Compute the moment of inertia |, for the solid above the xy plane bounded by the 
paraboloid z = x? + y? and the cylinder x? + y? = a”, assuming a and the mass density 
to be constants. 


The paraboloid and cylinder intersect at the plane z = a2. Using cylindrical coordinates, 
we find from equation (8), 


a pdx pr? a pax pr? 6 
k= f I f ér?-rdzdédr =o [ f I r3dzdédr = ws 
o Jo Jo o Jo Jo 3 A 


Gravitational Fields of Solid Objects 


Another interesting physical application of triple integration is the determination of the 
gravitational fields of solid objects. Example 7, Section 2.6, showed that the gravitational 
force field F(x, y, z) of a particle is the negative of the gradient of a function V (x, y, z) 
called the gravitational potential. If there is a point mass M at (x, y, z), then the 
gravitational potential acting on amass m at (x1, Yı, Z1) dueto this mass is —G mM [(x— 
X1)? + (y — y1)? + (z — 21)?]-¥/?, where G is the universal gravitational constant. 

If our attracting object occupies a domain W with mass density 5(x, y, z), we 
may think of it as made of infinitesimal box-shaped regions with masses dM = 
5(x, y, Z) dx dy dz located at points (x, y, z). The total gravitational potential V for 
W is then obtained by “summing” the potentials from the infinitesimal masses. T hus, 
we arrive at the triple integral (see Figure 6.3.4): 


2 ô(x, y, Z) dx dy dz 
V (Xi, Yı zı) = cm f/f V(x =x)? (y= yi)? + (2 — 21)? (9) 


Historical Note 


example 7 


solution 


figure 6.3.5 The gravitational 
potential at (x, yı, Z1) is the 
same as at (0,0, R), where 


=4/ S+% 42. 
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The theory of gravitational force fields and gravitational potentials was 
developed by Sir Isaac Newton (1642-1727). Newton withheld publication of his 
gravitational theories for quite some time. The result that a spherical planet has 
the same gravitational field as it would have if its mass were all concentrated at 
the planet's center first appeared in his famous Philosophiae Naturalis Principia 
Mathematica, the first edition of which appeared in 1687. Using multiple integrals 
and spherical coordinates, we shall solve Newton's problem here; remarkably, 
Newton's published solution used only Euclidean geometry. 


Let W bea region of constant density and total mass M . Show that the gravitational 
potential is given by 


1 
V (Xi, Yu Z1) = B GM m, 
av 


where [1/r Jay is the average over W of 


1 


f(x, y,2) = | 
a ay Sa 


According to formula (9 


V (Xi, Yas 21) mom ff JK our a +(z—2;)? 
=e Hh V(x =x) so + (z= 2) 
th V(x = x1)? oe + (Z— z)? 


volume (W ) 


= Gml[6d volume (W 


| 
r av 


=GmM 


are required. A 


Let us now use formula (9) and spherical coordinates to find the gravitational poten- 
tial V (X1, Yı, 21) for a region W with constant density between the concentric spheres 
p = pı and p = p, assuming the density is constant. Before evaluating the integral 
in formula (9), we make some observations that will simplify the computation. B e- 
cause G, m, and the density are constants, we may ignore them at first. Because the 
attracting body, W , is symmetric with respect to rotations about the origin, the poten- 
tial V (X1, Yı, Z1) should itself be symmetric— thus, V (x1, yi, Z1) depends only on the 
distance R = \/x?+ y? + z2 from the origin. Our computation will be simplest if we 
look at the point (0, 0, R) on the z axis (see Figure 6.3.5). Thus, we need to evaluate the 


integral 
V(0, 0, R) =-/ff dx dy dz . 
VX? +y? + (z— R)? 
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In spherical coordinates, W is described by the inequalities p1 < p < p2, 0 < 0 < 2x, 
and 0 < @ < x, and so 


P2 T 2x Ie 
=V(0, 0,8) = f f f p sing do de dp 
pm Jo Jo yosin? p(cos?0 + sin? 0) + (pcoso-— R)? 


Replacing cos?@ + sin? 0 by 1, so that the integrand no longer involves 6, we may 
integrate over 0 to get 


V(0, 0, R) =r f [ aL, 
Vp? sin? o + (pcos — R)? 


=o f” re ý sing do dp 
i 0 /p2—2Rpcos¢ +R? 


The inner integral over œ may be evaluated using the substitution u = —2Rpcos¢. 
We get 
1 Pre 2 2\-1/2 2 2 2)1/2 “ 
a (o +u +R?) du = ——(p*+u+ R*) 
2Rp i= 2Rp —2Rp 
1 
= Rp lle? + 2Re + RY — (0% — 2Rp + R?)”?] 
1 
= Rp lle + R)? — [o — RPP? } 
1 
= Reet —|o— RI). 


The expression p + R is always positive, but » — R may not be, so we must keep the 
absolute value sign. Substituting into the formula for V , we get 


2 


P2? p ax p2 
-V (0,0, R) = 2x | P (p+ R—[p—Rildo= = f plo +R -lp — R|) do 
a Re R Ja 


We consider two possibilities for R, corresponding to the gravitational potential for 
objects outside and inside the hollow ball W. 


CasæL If R > p [thatis, if (x1, Yı, Z1) is outside W J, then |o — R| = R — p for all p 
in the interval [ 01, p2], so that 


2 p2 4 p2 4 
-V00 R == [pip +R-(R—plldo= = f p?dp = 7" (63 - o). 
pl pl 


The factor (42/3) (3 — p?) equals the volume of W . Putting back the constants G, m, 
and the mass density, we find that the gravitational potential is -GmM /R, where M is 
the mass of W . Thus, V is just as it would be if all the mass of W were concentrated at 
the central point. 
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Case2. If R < p [thatis, if (x1, yi, 21) is inside the hole], then |o — R| = p — R for 
p in[ pi, 2], and so 


2 p2 p2 
-V (0,0, R) = (Gm Z f plo +R — (p — R)ldp = (Gm)4x f pdp 


Pı PL 


= (Gm)2x (of — p?). 


The result is independent of R, and so the potential V is constant inside the hole. 
B ecause the gravitational force is minus the gradient of V , we conclude that there is no 
gravitational force inside a uniform hollow planet! 


We leave it to you to compute V (0, 0, R) for the case pı < R < pp. 

A similar argument shows that the gravitational potential outside any spherically 
symmetric body of mass M (even if the density is variable) is V = G M m/R, where R 
is the distance to its center (which is its center of mass). 


example 8 | Find the gravitational potential acting on a unit mass of a spherical star with a mass 
M = 3.02 x 10% kg at a distance of 2.25 x 10!! m from its center (G = 6.67 x 
1071 N - m2/kg?). 


solution | Thenegative potential is 


GM 6.67 x 107" x 3.02 x 102 PIPE 
Ysg = EEEN =8.95 x 108 mst. Aa 


exercises 


L Find the coordinates of the center of mass of an 8(X, y) = y? sin24x + 2 (grams per square centimeter). 


isosceles triangle of uniform density bounded by the 
x axis, y = ax, and y = —ax + 2a. 


2 Assuming uniform density, find the coordinates of the 


center of mass of the semicircle y = vr? — x2, with 
y>0. 


Find the average of f(x, y) = ysinxy over D = 
[0, x] x [0, x]. 


Find the average of f(x, y) = e**Y over the triangle 
with vertices (0, 0), (0, 1), and (1, 0). 


Find the center of mass of the region between y = x2 
and y = x if the density is x + y. 


Find the center of mass of the region between y = 0 and 
y =x’, where0 <x < 4. 


7. A sculptured gold plate D is defined by 0 < x < 27 and 


0 < y < x (centimeters) and has mass density 


If gold sells for $7 per gram, how much is the gold in the 
plate worth? 


In Exercise 7, what is the average mass density in grams 
per square centimeter? 


(a) Find the mass of the box [0, 1] x [0, 1] x [0, 2], 
assuming the density to be uniform. 


(b) Same as part (a), but with a mass density 
8(X, y, Z) = X? + 3y? +z +1. 


Find the mass of the solid bounded by the cylinder 
x2 + y? = 2x and the cone z? = x? + y? if the density 


isd = \/x2 + y2. 


Find the mass of the solid ball of radius 5 with density 
given by 


8(X, y, Z) = 2x? + 2y? + 2z? 41 


assuming the center of the ball is at the origin. 
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12. A solid disk of radius 9 and height 2 is placed at the 
origin, so that it can be expressed by x2 + y? = 81 and 
0 <z <2. If the disk has a density given by 


8(X, y, Z) = 2x2 + 2y? + 272 +1, 
find its mass. 


13. Find the center of mass of the region bounded by 2. 


X+y+z=2,x =0, y = 0, and z = 0, assuming the 
density to be uniform. 


14. Find the center of mass of the cylinder 
x2 +y? < 1,1 <z < 2 if the density is 
8 = (x? + y2)z?, 


15. Find the average value of sin? zz cos? xx over the cube 
[0, 2] x [0, 4] x [0, 6]. 


16. Find the average value of e77 over the ball 
X +y? +z <L 


17. A solid with constant density is bounded above by the 
plane z = a and below by the cone described in 
spherical coordinates by @ = k, where k is a constant 
0 < k < 2/2. Set up an integral for its moment of 


inertia about the z axis. 23. 


18. Find the moment of inertia around the y axis for the ball 
x2 + y? + 22 < R2 if the mass density is a constant 8. 


19. Find the gravitational potential on a mass m of a 
spherical planet with mass M = 3 x 1026 kg, ata 
distance of 2 x 108 m from its center. 


20. Find the gravitational force exerted on a 70-kg object at 
the position in Exercise 19. 


2L A body W in xyz coordinates is called symmetric with 


respect to a given plane if for every particle on one side 24. 


of the plane there is a particle of equal mass located at 
its mirror image through the plane. 


(a) Discuss the planes of symmetry for an automobile 
shell. 


(b) Letthe plane of symmetry be the xy plane, and 
denote by W + and W ~ the portions of W above and 


below the plane, respectively. By our assumption, 25. 


the mass density 5(Xx, y, z) satisfies 5(x, y, —z) = 
5(X, y, Z). Justify the following steps: 


z- Jf s0e,y,2) dedyde = || 25(X, y, Z) dx dy dz 
w w 


(c) Explain why part (b) proves that if a body is 
symmetrical with respect to a plane, then its center 
of mass lies in that plane. 


(d) Derive this law of mechanics: If a body is symmetric 
with respect to two planes, then its center of mass 
lies on their line of intersection. 


A uniform rectangular steel plate of sides a and b rotates 
about its center of mass with constant angular velocity w. 


(a) The kinetic energy equals 3(mass)(velocity)?. 
Argue that the kinetic energy of any element of 
mass 5 dx dy(é = constant) is given by 
3(w2/2)(x2 + y?) dx dy, provided the origin (0, 0) is 
placed at the center of mass of the plate. 


Justify the formula for kinetic energy: 


we 
Kes ff 8— (x? + y?) dxdy. 
plate 2 


Evaluate the integral, assuming that the plate is 
described by the inequalities —a/2 < x < a/2, 
—b/2 < y < b/2. 


(b 


(c 


Asis well known, the density of a typical planet is not 
constant throughout the planet. A ssume that planet 
C.M .W. has a radius of 5 x 108 cm and a mass density 
(in grams per cubic centimeter) 


3x 104 
p(x, Y,Z) = r 
3, r < 104cm, 


wherer = \/x2 + y? + z2. Find a formula for the 


gravitational potential outside C.M .W. 


, r >10fcm, 


Let D bea region in the part of the xy plane with x > 0. 
Assume D has uniform density, area A(D ), and center of 
mass (X, y). Let W be the solid obtained by rotating D 

about the y axis. Show that the volume of W is given by 


vol(W) = 27xA(D). 
Use the previous exercise to show that if a doughnut is 


obtained by rotating the circle (x — a)? + y? =r? about 
the y axis, then the volume of the doughnut is 2zrar2. 


= ITI 25(X, y.zyaxdyar+ || 26(X, y, z) dx dydz 

w+ w- 
=f zax y,2) dxdyda + ff / —w(u, v, -w)dududw 
af w+ W+ 
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6.4 Improper Integrals (Optional) 


YA y= Q(x) 


y = o4(X) 
a b 
figure 6.4.1 A y-simple domain. 


> X 


In this section, we study improper integrals— that is, integrals in which the function 
may be unbounded or the region of integration is unbounded. We shall first recall the 
situation for functions of one variable. 


One-Variable Improper Integrals 


In the study of integrals of functions of one variable, we encounter various types of 
“improper” integrals; that is, integrals of unbounded functions defined on intervals or 
integrals of functions over unbounded intervals. For example, 


=q ei dx 
—dadx and I _ 
0 JK i x2 


are improper integrals. They are evaluated using a limiting process; for instance, 


1 
1 ' 1 
[ zem] qe = lim (2 |,) = Lim (2 — 2va) =2 
and 
a odx o, 1 
1 = lim m f Z= lim —-— 
1 X2? boo fy X? b>% X 


If, in such a limiting process, the limit does not exist (or is infinite), we say that the 
integral does not exist (or that the integral diverges). 


Improper Integrals in the Plane 


Next, we describe three types of improper integrals of two variables over a region D. 
The first two types are described in the text below, and the third type (integrals over 
unbounded regions) is left to the exercises. We will evaluate all integrals using a limiting 
process, as in the one-variable case. 

For simplicity of exposition, we first restrict ourselves to nonnegative functions f — 
thatis, f(x, y) > Oforall points (x, y) < D — and to y-simple regions described as the 
set of (x, y) such that 


a<x<b, = gi(x) <y < d(x), 


as in Figure 6.4.1. 
Inthe first case we wish to consider, let’s assume that f : D — Ris continuous except 
for points on the boundary of D . Consider, for example, 


f(x, y) = Ji- xy? 


where D is the unit disk D = {(x, y)|x* + y? < 1}. Clearly, f is not defined on the 
rater of D,wherex?+y? = 1; yetitwill be of practical interest to be able to evaluate 
JJ f(x, y) dA, because this integral represents the area of the upper hemisphere of the 
unit sphere in 3-space. 
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example | 


y y = G(X) 


figure 6.4.2 A shrunken domain D,,s for 
improper integrals. 


Exhausting Regions 


Our basic idea will be to integrate such an f over a smaller region D’, where we know 
the integral exists, and then let D’ “tend” to D ; that is, “exhaust” D and seeif ffy f dA 
tends to some limit. With this in mind, we pick a special kind of D’, as follows. 

Let 7 > 0 be small enough so that a +n < b - n. Let > 0 be small enough so that 
oi(X) + 5 < (x) -ô for all x,a < x < b (See Figure 6.4.2). If g2(x) = @i(x) for 
some x, no such 6 will exist, but we shall worry about this minor issue when it arises in 
our later examples. Then the region 


D pa = {(x, yla +n <x<b-n and gi(x)+5<y < (x) — ô} 


is a subset of D , and as (7, 5) > 0, D „s tends to D. 


Improper Integrals as Limits 


Because f is continuous and bounded on D,,;, the integral ff D „sf dA exists. We can 
now ask what happens as the region D „,, expands to fill the region, D — that is, as 


(n, 5) — (0, 0). Provided that 
lim J | f dA 
(na> J Jp, , 


exists, we say that the integral of f over D is convergent or that f is integrable over 
D, and we define ffa fdx dy to be equal to this limit. 


Evaluate 


Il zz” 


where D is the unit square [0, 1] x [0, 1]. 


solution 


X 


figure 6.4.3 The slightly shrunken 
unit square. 
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D is clearly a y-simple region. Choose 7 > 0 and ô > 0 so that D, c D, asin 
Figure 6.4.3. Then, by Fubini’s theorem: 


1 l-n 1-6 1 
I | 4h = f vag 
Dye KY n ô KY 


l-n 1 1-65 1 
af sax f Ža 
n JX ô VSV y 


= : (u n)2/3 ne) . : (u 6)2/3 2e), 


Letting (n, 5) — (0, 0), we see that 


im, fS te= 
(n,8)—>(0,0) na” 22 Ẹ A 


Unfortunately, it may not always be possible to evaluate such limits so directly and 
simply. This is often the case in the most interesting examples, as with the surface area of 
the hemisphere, mentioned earlier. It’s as if the “real world” always presents the greatest 
challenges to the mathematician! So let us expand a bit on our theoretical discussion. 


Improper Integrals as Limits of Iterated Integrals 


Suppose f is integrable over D,,5. We can then apply Fubini’s theorem to obtain 


ġ2(x)— 
/ n f dA = ie (x, y) dy dx. 
Die a+n o1(X 


Hence, if f is integrable over D, 


b-n 
fdA= lim Jf. E Gave (1) 
JI, (7,6)>(0,0) Ja pı(x)+8 y) J 


Now F ( = ffo, f dA is a function of two variables, n and 5, because as we 
change 7 a A we get another number. Now if f is integrable, then 


lim F(n,8)= L 
all , (n, 6) 


exists. It follows that the iterated limits 


lim lim F (n, ô) and lim lim F (n, 5) 


7705-0 5607-0 


also exist and are both equal to L, which in our caseis f fẹ fdA. Thus, the iterated limit 


b-n 
inim f E (x, y) dy dx 
n—>0ô—>0 or(x)+ 


also exists. Conversely, if the iterated limits exist, it does not generally follow that the 
limit liM(,,5)-+(0,0) F (77, 6) exists. 
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For example, if it were to turn out in some way that F (7, 5) = /(n* + 62), then 
limo liMso F (n, 6) = limsolim,—+o F (7,6) = 0; yet lim(,,s)+0 F (n, 6) does not 
exist, because F (n, n) = 1/2 (see Section 2.2). 

In view of this, consider expression (1) again. If f is integrable, then 


b-n 2(x)—6 
[fe f(x,y) d A= tims [ f, f(x, y) dy dx 


b-n poz(x)—ô 
= lim lim m | | f(x, y) dy dx. 
1 (0) 89 Jorn Sor(x)+8 


2(x)—5 
lim | F(x, y) dy 
(x) +5 


© 


Now suppose that for each x, 


exists. Denote this by o f (x, y) dy. Suppose further that 


h(x) 
im f J f(x, y) dy 
19 Jatn Joi) 


also exists. We denote this limit by J? PoR f(x, y) dy dx. Then if all limits exist, all 


limits must be equal. Thus, if f is integrable and the iterated improper integral exists, 


then necessarily 
f| foun aa = [lf f(x, y) dy dx. 
orl 


However, is it possible that the existence of just the iterated integrals implies the inte- 
grability of f ? We turn to this important question next. 


Fubini’s Theorem for Improper Integrals 


For integrals, something truly remarkable happens. U nlike the case for iterated limits (as 
in the counterexample considered earlier), the existence of the iterated limits does imply 
the integrability of f as long as f > 0. Thus, if f > 0 and if f° Ion (x, y) dy dx 
exists as an iterated limit, then f is integrable and 


b p(x) 
J) txya=/ f f(x, y) dy dx. 
D a J (x) 


If D is an x-simple region with the x coordinate lying between two functions y and 


Wo, and if 
yl y) 
[ fi (x, y) dxdy 
yal 


exists as an improper integral, it again follows that f is integrable and 


f| fo aa = Pp f(x, y) dxdy. 
wily) 


example 2 


solution 


example 3 


solution 
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All these results, which are the improper analogues of Theorems 4 and 4’ in Section 5.3, 
are known as F ubini’s theorem for improper integrals, which we formally state. 


Theorem 3 Fubini’s Theorem Let D bean elementary region in the plane 
and f > 0 a function continuous except for points possibly on the boundary of D . 
If either of the integrals 


/I f(x, y) dA, 


[ fe (x, y) dy dx, for y-simple regions 
g1(x) 


yaly) 
l T f(x, y) dx dy for x-simple regions 
c Jyly) 


exist as improper integrals, f is integrable and they are all equal. 


The proof of this involves advanced concepts of analysis, so we omit it here. This 
result can be quite useful in calculation, as the next example shows. 


Let f(x, y) =1/\/1— x? — y?. Show that f is integrable and that ffa f(x, y) dA = 
27, half the surface area of the unit sphere. 


For —1 < x < 1, we have 


dy j a/1+x2—ô dy 
-yi2 Vf/1— x? y? 0J xs y1- xX? y? 
a/1—x?—8 
= limsin~ ( y ) | 
5-40 JI =x? —/1-x245 
ô ô 
= lim 4 sin ( 1 — —___ ]=sin" (-1 )} 
= sin™}(1) — sin N=5 z =m 
Clearly, 
l-n 1- fix? dy dx l-n 
li =li =li = = 2x. 
imf S s- a ee lim bis dx lim x(2 2n) = 27 


Thus, f is integrable. To see why this theorem is so useful, try to show directly from 
the definition that f is integrable. Itis not easy to do so! A 


Let f(x, y) = 1/(x — y) and let D be the set of (x, y) satisfying 0 < x < 1 and 
0 < y < x. Show that f is not integrable over D. 


B ecause the denominator of f is zero on the line y = x, f is unbounded on part of the 
boundary of D. Let 0 < n < 1and0 <6 < n, and let D „s be the set of (x, y) with 
n<X <l—nandé < y <x -— ô (Figure 6.4.4). 
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figure 6.4.4 The shrunken 
domain D,,5 for a triangular 
domain D. 


Here the region D is y-simple with @1(x) =0, ¢2(x) = x, and (0) =@2(0). To ensure 
that D, c D and is depicted in the figure, we must choose ô a bit more carefully. A 
little geometry shows that we should choose 28 < n. Consider 


l-n x—ô 1 
1 fda = f f dy dx 
Dy, n ô X —y 


l-n 
=f [—log(x — y)] 


l-n 
z / [—log (8) + log (x — 8) dx 


l-n l-n 
= |- 1o93] f ax+ f log (x — ô) dx 
n 


n 


x—ô 
yas dX 


= —(1 — 2n) logs + [(x — 8) log (x — 8) — (x — 8)”. 
In the last step, we used the fact that f logu du = u logu — u. Continuing the preceding 
set of qualities, we have 
JJ f dA = —(1 — 2n) logs + (1 — n — ô) log(1 — n — 8) 
Dys 
1(1— n — ô) — (n — 8) log (n — 6) + (n — ô). 


As (n, 5) — (0, 0), the second term converges to 1 log 1 = 0, and the third and fifth 
terms converge to —1 and 0, respectively. Let v = 7 — ê. Because v log v > Oas v —> 0 
(alimit established by using L'H ôpital’s rule from calculus?), we see that the fourth term 
goes to zero as (7, 5) — (0, 0). It is the first term that will give us trouble. Now: 


—(1—2n) logs = — log 8 + 2n log ô, (2) 


and it is not hard to see that this does not converge as (7, 5) —> (0, 0). For example, 
let 7 = 28; then expression (2) becomes — log 8 + 4élog 5. As before, 45log5 —> 0 
as 5 — 0, but —logs — +00 as 5 — 0, which shows that expression (2) does not 
converge. Hence, lim,,,s) (0,0) So, f dA does notexistandso f isnotintegrable. A 


Functions Unbounded at Isolated Points 


We now consider nonnegative functions f that become “infinite” or are undefined at 
isolated points in an x-simple or y-simple region D . For example, consider the function 
f(x, y) = 1/,/x2-4+ y? on the unit disk D = {(x, y)|x* + y? < 1}. Again, f > 0, but 
f is unbounded and is not defined at the origin. 

Let (Xo, Yo) be a point of a general region D where a nonnegative function f is 
undefined. Further, let Ds = D4 (Xo, Yo) be the disk of radius 5 centered at (Xo, Yo) 
and let D \D denote the region D with D, removed. Assume that f is continuous at 
every point of D except (Xo, yo). Then f fop, f dA is defined. We say that ffa f dA is 
convergent, or that f is integrable over D If 


im [ f dA 
60 D\Ds 


exists. 


3L' Hôpital’s rule was discovered by Bernoulli and was reported in L'H ôpital’s textbook. 


6.4 Improper Integrals (Optional) 345 


example 4 | Show that f(x,y) =1/,/x2 + y? is integrable over the unit disk D and evaluate 
Jp fdA. 


solution Let D, bethedisk of radius 5 centered at the origin. Then f is continuous everywhere on 
D exceptat(0, 0). Thus, ffon, f dA exists. To evaluate this integral, wechange variables 
to polar coordinates, x =r cos, y =r sing. Then f(r cosdé,r sing) =1/r, and 


1 2x 1 1 2x 
J] faa = | f =f de dr = f de dr =2n(1—2). 
D\Ds s Jo T 5 JO 
J| fan = tim f) a å 
D 60 D\Ds 


M ore generally, we can, in an analogous manner, define the integral of nonnegative 
functions f that are continuous except at a finite number of points in D. We can also 
combine both types of improper integrals; that is, we may consider functions that are 
continuous except at a finite number of points on D or at points on the boundary of D, 
and define ffa f dA appropriately. 

If f takes both positive and negative values, we can use a more advanced integration 
theory, called the Lebesgue integral, to generalize the notion of convergent integral ffo 
f dA. Using this theory, it is possible to show that if ff, f dA exists, it can then be 
evaluated as an iterated integral. This latter fact is also known as Fubini’s theorem. 


Thus, 


Unbounded Regions 


As was mentioned previously, we will leave consideration of unbounded regions to the 
exercise section. However, we must point out that we have already addressed the main 
idea in Example 5 of Section 6.2 on the Gaussian integral. In that example, we integrated 
exp(—x* — y?) over all of R? by integrating first over a disk of radius a and then letting 
aro. 


exercises 


In Exercises 1 to 4, evaluate the following integrals if they exist (discuss how you define the integral if it was not given in the 
text). 


1 5. Let D =[0, 1] x [0, 1]. Let0 <a <1land0 <6 <1. 
L I» dA, where D = [0, 1] x [0, 1] Evaluiàte: 
dxdy 

2 ——l — dxdy, where D = {(x, 0<x<1, : 

Ika i i JI xey? 

0<y<1,y <x} 

6. Let D = [1, œ) x [1, œ]. Let 1 < y and 1 < p. 
3. J [iym aeay, wrer D is bounded by x = 1, x = y, Evaluate: 
D 


and x = 2y 


1 pe 
4 I | log x dx dy 
0 Jo 


dx dy 
D xY ye ` 
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7. (a) Evaluate 


dA 
p (x2 + y2)2/3 


where D is the unit disk in R2. 


(b) Determine the real numbers à for which the integral 
1 dA 
D (x2 + y?) 
is convergent, where again D is the unit disk. 
8. (a) Discuss how you would define ffa f dA if D isan 


unbounded region— for example, the set of (x, y) 
such thata < x < oo and ¢1(x) < y < ¢2(x), 
where ¢1 < #2 are given (Figure 6.4.5). 
Evaluate ffy xye(* +9") dx dy if x > 0, 
O<y<l. 


~ 
o 


y = ġ1(x) 


figure 6.4.5 An unbounded region D. 


9. Using Exercise 8, integrate e-*Y for x >0,1<y <2 
in two ways. Assuming Fubini’s theorem can be used, 


show that 
CO a—X _ a—2X 
| ee ax =log2. 
0 X 


10. Show that the integral 


1 pa 
p f (x/,/a2 — y?) dy dx 
o Jo 


exists, and compute its value. 


11. Discuss whether the integral 


x+y 
II Pane 


exists where D = [0, 1] x [0, 1]. If it exists, compute its 
value. 


12. We can also consider improper integrals of functions 
that fail to be continuous on entire curves lying in some 


region D. For example, by breaking D = [0, 1] x [0, 1] 
into two regions, define and then discuss the 
convergence of the integral 


Ih aa 


13. Let W be the first octant of the ball x2 + y? +22 <a’, 
where x > 0, y > 0, z > 0. Evaluate the improper 


integral 
24 y2 4 92)1/4 
JII (x+ yé + 2°) dx dy dz 
w /Z +(x? +y? +22) 


by changing variables. 


14. Let f beanonnegative function that may be unbounded 
and discontinuous on the boundary of an elementary 
region D. Let g be a similar function such that 
f(x, y) < g(x, y) whenever both are defined. Suppose 
Sfo g(x, y) dA exists. Argue informally that this 
implies the existence of Us f(x, y) dA. 


15. Use Exercise 14 to show that 


2 
[= sin(x — = 
— 
exists where D is the unit disk x? + y2 <1. 


16. Let f beas in Exercise 14 and let g be a function such 
that 0 < g(x, y) < f (x, y) whenever both are defined. 
Suppose that f fy g(x, y) dA does not exist. Argue 


informally that JJ f(x, y) dA cannot exist. 


T7. Use Exercise 16 to show that 


x?+y? 
Jf e dy dx 
p XY 
does not exist, where D is the set of (x, y) with 
0<x<land0<y <x. 


18. Let D be the unbounded region defined as the set of 
(x, y, z) with x? + y? +z? > 1. By making a change of 
variables, evaluate the improper integral 


dx dy dz 
p (X2 + y? ee? 
19. Evaluate 
1 pyy 1 fiy 
—dx dy and f | o 
Lf y o Jx Y 


Does Fubini’s theorem apply? 


20. In Exercise 17 of Section 5.2 we showed that 


lpl 2 2 ll y2 2 
x*-y xt- y 
—— 7 dyd a dx dy. 
ih 24y” f pepy 
Thus, Fubini’s theorem does not hold here, even though 


the iterated improper integrals both exist. What went 
wrong? 


2L If 0 < f(x,y) < g(x, y) forall (x, y) € D, and the 
improper integral of g 


ffon 
D 
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exists, then ffa f(x, y) dx dy also exists. U se this fact 
and exercises 5 and 6 to argue that if 0 < œ, 8 < land 


1 < y, p, then 
dx dy 
xayf 4 xvye 
D 


exists, where D = [0, 00) x [0, 00). 
[HINT: Write D = D1 U D2 and apply Exercise 14 to 
each Dj; separately.] 


L (a) Find a linear transformation taking the square 
S = [0, 1] x [0, 1] to the parallelogram P with 
vertices (0, 0), (2, 0), (1, 2), (3, 2). 


(b) Write down a change of variables formula 
appropriate to the transformation you found in 
part (a). 
2. (a) Find the image of the square [0, 1] x [0, 1] under the 
transformation T (x, y) = (2x, x + 3y). 
(b) Write down a change of variables formula 


appropriate to the transformation and the region you 
found in part (a). 


3. Let B be the region in tae trat quad ant pete By the 
curves xy = 1, xy = 3, x? — y? = 1, and x? — y? = 4. 
Evaluate ffy (x? + y; 2) dx dy using the change of 
variables u = x? — y?, v = xy. 


4. In parts (a) to (d), make the indicated change of 
variables. (Do not evaluata) 


OL S rn 


p Sor 


J (4—x2-y?) 
(b) a ive T xyzdzdxdy, 
J(4-x2-y?) 


iil cooing 


J (4—x2-y?) 
(c) I. To ee z2dz dx dy, 


spherical ee 


a + y2)"/2dx dy dz, 


1 pr/4 pda 
(d) p> sin 2¢ d0 dọ dp, rectangular 


0 Jo 0 
coordinates 


5. Find the volume inside the surfaces x? + y? = z and 
x2 4 y? 4724 =2. 


6. Find the volume enclosed by the cone x2 + y? = z? and 
the plane 2z — y — 2 = 0. 


7. A cylindrical hole of diameter 1 is bored through a 
sphere of radius 2. A ssuming that the axis of the cylinder 
passes through the center of the sphere, find the volume 
of the solid that remains. 


8. Let C1 and C2 be two cylinders of infinite extent, of 
diameter 2, and with axes on the x and y axes, 
respectively. Find the volume of their intersection, 
Cina: 


9. Find the volume bounded by x/a + y/b +z/c = 1 and 
the coordinate planes. 


10. Find the volume determined by z < 6 — x? — y? and 
Z> yX? +y? 


IL The tetrahedron defined by 
x>0,y>0,z>0,x +y+z < 1is to be sliced into n 
segments of equal volume by planes parallel to the plane 
X + y +Z = 1. Where should the slices be made? 


12. Let E bethe solid ellipsoid E = {(x, y, z) | (x2/a2)+ 
( y2/b2) + (z2/c2) < 1}, wherea > 0,b > 0, 
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and c > 0. Evaluate 


J [[vveccayee 


(a) over the whole ellipsoid; and 
(b) over that part of it in the first quadrant: 


y2 z2 


x>0, y>0, and z>0, tatas 


Find the volume of Tie ice cream cone” defined by the 
inequalities x2 + y? < z2, and 0 < z < 


54+ 4/5—x2— y2. 


Let p, 6, @ be spherical coordinates in R? and suppose 
that a surface surrounding the origin is described by a 
continuous positive function o = f(6, ¢). Show that the 
volume enclosed by the surface is 


2r m 
V =; f | [f(6, 6) FP singddde. 
3 Jo Jo 


Using an appropriate change of variables, evaluate 


I exp[(y —x)/(y + x)] dxdy, 
B 


where B is the interior of the triangle with vertices at 
(0, 0), (0, 1), and (1, 0). 


Suppose the density of a solid of radius R is given by 
(1 +d3)—1, where d is the distance to the center of the 
sphere. Find the total mass of the sphere. 


17. The density of the material of a spherical shell whose 


inner radius is 1 m and whose outer radius is 2 mis 
0.4d? g/cm?, where d is the distance to the center of the 
sphere in meters. Find the total mass of the shell. 


If the shell in Exercise 17 were dropped into a large tank 
of pure water, would it float? What if the shell leaked? 
(Assume that the density of water is exactly 1 g/cm3,) 


19. The temperature at points in the cube C = {(x, y, z) | — 


1<x <1,-l<y < 1,and —1 < z < 1} is 32d?, 
where d is the distance to the origin. 


(a) What is the average temperature? 


(b) At what points of the cube is the temperature equal 
to the average temperature? 


Use cylindrical coordinates to find the center of mass of 
the region defined by 


(x-1)? +y? 427 <1, X>1. 


<1. 


ZL Find the center of mass of the solid hemisphere 


V = {(x, y, Z) |x? + y? +z? < a? and z > 0}, 
if the density is constant. 


Evaluate ff} e oo dx dy, where B consists of those 
(x, y) satisfying x? + y? < 1 and y <0. 


dx dy dz 
s (XZ + y? zee 


where S is the solid bounded by the spheres 
x2 + y? +z? = a? and x2 + y? + z? = b?, where 
a>b>0. 


Evaluate 


Evaluate (x? + y? + z?) xyz dx dy dz over each of 
D 
the following regions. 


(a) The sphere D = {(x, y, z) | X2? + y? +z? < R2} 
(b) The hemisphere D = {(x, y, z) | x2-+ 
y? +z? < R? andz > 0} 
(c) Theoctant D = {(x, y,z) |x >0,y>0,z>0, 
and z? + y? +z? < R?} 


LetC be the cone-shaped region 
{(x, y, 2) | \/x2 + y? < z < 1} in R? and evaluate the 


integral JS | 0+ Vay 
c 


Find III. f (x, y, z) dx dy dz, where 
F(x, y, z) = exp [—(x? + y? + z?)3/?]. 


27. The flexural rigidity El of a uniform beam is the product 


of its Young's modulus of elasticity E and the moment of 
inertia | of the cross section of the beam with respect to 
a horizontal line! passing through the center of gravity 
of this cross section. Here 


| =j [d(x, y)I2dx dy, 
R 


where d(x, y) = the distance from (x, y) tol and R = 
the cross section of the beam being considered. 


(a) Assume that the cross section R is the rectangle 
—1 <x <1,-l<y < 2, and | is the line y = 1/2. 
Find |. 

(b) Assume the cross section R is a circle of radius 4 
and | is the x axis. Find |, using polar coordinates. 


28. Find, ff fgs f(x, y, z) dx dy dz, where 


1 


f(x,y,z) = [1 4 (x2 + y2 + z2)372]372 ` 


29. Suppose D is the unbounded region of IR? given by the 
set of (x, y) withO < x < œ,0 < y < x. Let 
f(x, y) =x73/2eY-*, Does the improper integral 
Jo f (x, y) dx dy exist? 


30. If the world were two-dimensional, the laws of physics 
would predict that the gravitational potential of a mass 
point is proportional to the logarithm of the distance 
from the point. Using polar coordinates, write an integral 
giving the gravitational potential of a disk of constant 
density. 


3L (a) Evaluate the improper integral 


oo py R 
| f xe™” dxdy. 
o Jo 
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(b) Evaluate 


J [0 + 2x2y? + y4) dxdy, 
B 


where B is the portion of the disk of radius 2 
[centered at (0, 0) in the first quadrant]. 


32. Let f be a nonnegative function on an x-simple or a 
y-simple region D c R? and that is continuous except 
for points on the boundary of D and at most finitely 
many points interior to D. Give a suitable definition of 


[Jy f dA. 


33. Evaluate J fae f(x, y) dx dy, where f(x, y) = 
1/(1 + x2 + y2)3/2. (HINT: You may assume that 
changing variables and Fubini’s theorem are valid for 
improper integrals.) 


this page left intentionally blank 


7 


Integrals Over Patns 
and surfaces 


| hold in fact: (1) That small portions of space are of a nature analogous fo little hills on a 
surface which is on the average flat. (2) That this property of being curved or distorted is 
continually passed on from one portion of space to another after the manner of a wave. 
(3) That this variation of curvature of space is really what happens in that phenomenon 
which we call the motion of matter whether ponderable or ethereal. (4) That in this physical 
world nothing else takes place but this variation, subject, possibly, to the law of continuity. 


—W. K. Clifford (1870) 


Everyone who is seriously involved in the pursuit of science becomes convinced that a spirit 
is manifest in the laws of the universe, one that is vastly superior to that of man. 


—Albert Einstein 


In Chapter 5 we studied integration over regions in R? and Rê. In this 
chapter we study integration over paths and surfaces. This is basic to an 
understanding of Chapter 8, in which we discuss the basic relation be- 
tween vector differential calculus (Chapter 4) and vector integral calcu- 
lus (this chapter), a relation that generalizes the fundamental theorem 
of calculus to several variables. This generalization is summarized in the 


theorems of Green, Gauss, and Stokes. 


7.1 The Path Integral 


This section introduces the concept of a path integral; this is one of the several ways in 
which integrals of functions of one variable can be generalized to functions of several 
variables. B esides those in Chapter 5, there are other generalizations, to be discussed in 
later sections. 

Suppose we are given a scalar function f: R? —> R, so that f sends points in R? to 
real numbers. It will be useful to define the integral of such a function f along a path 
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example | 


solution 


c:| = [a,b] — R?, where c(t) = (x(t), y(t), z(t)). To relate this notion to something 
tangible, suppose that the image of c represents a wire. We can let f(x, y, z) denote 
the mass density at (x, y, z) and the integral of f will be the total mass of the wire. 
By letting f(x, y, z) indicate temperature, we can also use the integral to determine 
the average temperature along the wire. We first give the formal definition of the path 
integral and then, after the following example, further motivate it. 


Definition Path Integrals The path integral, or the integral of f(x, y, z) 
along the path c, is defined when c: | = [a, b] — R? is of class C+ and when 
the composite function t > f(x(t), y(t), z(t)) is continuous on |. We define this 
integral by the equation 


b 
[r= | F (x(t), y(t), 2(t)) Iet) dt. 


Sometimes f, f ds is denoted 


[tyas 


b 
| f (e(t) ) e(t) Il dt. 


or 


If c(t) is only piecewise C? or f(¢(t)) is piecewise continuous, we define 
J, f ds by breaking [a, b] into pieces over which f(¢(t))|\Ie(t)|| is continuous, 
and summing the integrals over the pieces. 


When f = 1, we recover the definition of the arc length of c. Also note that f need 
only be defined on the image curve C of c and not necessarily on the whole space in 
order for the preceding definition to make sense. 


Let c be the helix c: [0, 27] — R?,t + (cost, sint, t) (see Figure 2.4.9), and let 
f(x, yY, Z) =x? + y? + 2”. Evaluate the integral f. f(x, y, z) ds. 


First we compute ||¢’(t) ||: 


2 . 2 2 
let) = (ee) + F + H = v sin?t + cos? t +1 = v2. 


Next, we substitute for x, y, and z in terms of t to obtain 
f(X, y,z) = X? + y? +z? = cost + sin? t +t? = 14t? 


along c. Inserting this information into the definition of the path integral yields 


2x 3727 
[tocy2as= f (+e)v2dt= v2 [t+ 5| = EMOTE a gay, 
c 0 


3 0 3 A 


figure 7.1.1 Breaking ¢ into 
smaller ¢;. 
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C(ty) 


c (ti 1) 


e—e—e—e—0—0— 00000 
a=ħb tt ti ty) ty =b 


To motivate the definition of the path integral, we shall consider “Riemann-like” sums 
Sy in the same general way we did to define arc length in Section 4.2. For simplicity, 
let c be of class C+ on I. Subdivide the interval | = [a, b] by means of a partition 


a=t<t <---<ty =D. 


This leads to a decomposition of c into paths c; (Figure 7.1.1) defined on [t;, ti+1] for 
0 <i <N — 1. Denote the arc length of c; by As;; thus, 


tiga 
as = f ye(tiet 
ti 


When N is large, the arc length As; is small and f(x, y, z) is approximately constant 
for points on ¢,. We consider the sums 


N-1 
Sn DS f (Xi, yi, Zi) ASi, 
i=0 


where (Xi, yi, zi) = c(t) for somet e [ti, ti+1]. By the mean-value theorem we know 
that As; = |jc’(t*) || Ati, where ti < t* < t41 and At; = t)4; — ti. From the theory of 
Riemann sums, it can be shown that 


N-1 


limit Sy = limit >, f (Xi Yi, Zi) le (t#) At = f f (x(t), y(t), 2(t)) Je(t) || dt 


= tix y,204s 


The Path Integral for Planar Curves 


An important special case of the path integral occurs when the path c describes a plane 
curve. Suppose that all points c(t) lie in the xy plane and f is a real-valued function of 
two variables. The path integral of f along cis 


b 
[toonds= f EKD, yon VEE at 
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example 2 


solution 


F(x), yt) 


e= (C), yE) figure 7.1.2 The path integral as the area of 
a fence. 


When f(x, y) = 0, this integral has a geometric interpretation as the “area of a fence.” 
We can construct a “fence” with base the image of c and with height f(x, y) at (x, y) 
(Figure 7.1.2). If c moves only once along the image of c, the integral f. f(x, y) ds 
represents the area of aside of this fence. Readers should try to justify this interpretation 
for themselves, using an argument like the one used to justify the arc-length formula. 


Tom Sawyer's aunt has asked him to whitewash both sides of the old fence shown in 
Figure 7.1.3. Tom estimates that for each 25 ft? of whitewashing he lets someone do for 
him, the willing victim will pay 5 cents. How much can Tom hope to earn, assuming 
his aunt will provide whitewash free of charge? 


f(x, y)=1 +f 


c: t — (30 cos? z, 30 sin? t) 


figure 7.1.3 Tom Sawyer's fence. 


x 


From Figure 7.1.3, the base of the fence in the first quadrant is the path c: [0, 7/2] > 
R2,t + (30cos?t, 30sin?t), and the height of thefence at (x, y) is f(x, y) =1+y/3. 
The area of one side of the half of the fence is equal to the integral f. f(x, y)ds = 
J.(1 + y/3) ds. Because c'(t) = (—90 cos?t sint, 90 sin? t cost), we have |¢(t) || = 
90 sint cost. Thus, the integral is 


zf in3 
fart) as = | (E =) 0sint cost dt 
c 3 0 3 


x/2 
" 90 f (sint + 10 sint) cost dt 
0 


HA2 x/2 
= oof $E +2sin°t| =90(5 +2) = 225, 


0 
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which is the area in the first quadrant. Hence, the area of one side of the fence is 450 ft?. 
Because both sides are to be whitewashed, we must multiply by 2 to find the total area, 
which is 900 ft?. Dividing by 25 and then multiplying by 5, we find that Tom could 
realize as much as $1.80 for the job. A 


This concludes our study of integration of scalar functions over paths. In the next 
section we shall turn our attention to the integration of vector fields over paths, and we 
shall see many further applications of the path integral in Chapter 8, when we study 
vector analysis. 


Supplement to Section 7.1: The Total Curvature of a Curve 


Exercises 16, 17, and 20-23 of Section 4.2 described the notions of curvature « and tor- 
sion z of asmoothcurveC in space. If c: [a, b] > C CR?isaunit-speed parametrization 
of C ,so that ||¢’(t) || = 1, then the curvaturex«(p) at p € C isdefined by «(p) = |/e’(t) ||, 
where p = c(t). A result of differential geometry is that two unit-speed curves with 
the same curvature and torsion can be obtained from one another by a rigid rotation, 
translation, or reflection. 

The curvature «:C — R is a real-valued function on the set C, so we define the 
total curvature as its path integral over C : [. « ds. There are some surprising facts that 
mathematicians have been able to prove about the total curvature. For one thing, if C is 
a closed [that is, c(a) = ¢(b)] planar curve, then 


[ease 2x 
c 


and equals 27 only when C is a circle. If C is a closed space curve with 


[ads san 
č 


then C is“unknotted”; thatis, C can be continuously deformed (without ever intersecting 
itself ) into a planar circle. Therefore, for knotted curves, 


feds > 4r. 
c 


See Figure 7.1.4. 


figure 7.1.4 A knotted curve in R. 
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The formal statement of this fact is known as the Fary-Milnor theorem. Legend 
has it that John Milnor, a contemporary of John Nash's? at Princeton University, was 
asleep in a math class as the professor wrote three unsolved knot theory problems on the 
blackboard. A tthe end of the class, M ilnor (still an undergraduate) woke up and, thinking 
the blackboard problems were assigned as homework, quickly wrote them down. The 
following week he turned in the solution to all three problems— one of which was a proof 
of the Fary-M ilnor theorem! Some years later, he was appointed a professor at Princeton, 
and in 1962 he was awarded (albeit for other work) a Fields medal, mathematics’ highest 
honor, generally regarded as the mathematical N obel Prize. 


exercises 


In Exercises 1 to 4, find an appropriate parametrization for the given piecewise-smooth curve in R2, with the implied 
orientation. 


1. The curve C, which goes along the circle of radius 3, 
from the point (3, 0) to the point (—3, 0), and thenina 
straight line along the x-axis back to (3, 0) 


3. ThecurveC, which goes along y = sinx from the point 
(0, 0) to the point (x, 0), and then along the x-axis back 
to (0, 0) 


2. The curve C, which goes along y = x? from the 4. Theclosed curve C described by the ellipse 
point (0, 0) to the point (2, 4), then in a straight line (x —2)2 (y —3) 
from (2, 4) to (0, 4), and then along the y-axis back + = 
4 9 
to (0, 0) , 
oriented counterclockwise 


1 


In Exercises 5 to 8, find an appropriate parametrization for the given piecewise-smooth curve in R3. 


5. The intersection of the plane z = 3 with the elliptical 
cylinder 
x2 y 
om 
6. The triangle formed by traveling from the point (1, 2, 3) 
to (0, —2, 1), to (6, 4, 2), and back to (1, 2, 3) 


7. The intersection of the surfaces y = x and z = x3, from 
the point (—3, —3, 9) to (2, 2, 4) 


8. The intersection of the cylinder y? + z2 = 1 and the 


plane z = x 


9. Let f(x, y,z) = y and c(t) = (0,0,t), 0 <t < 1. 


Prove that k fds = 0. 


10. Evaluate the following path integrals Je f(x, y, z) ds, 


where 


(a) f(x,y,z) =x + y+ zand 
c: tr (sint, cost, t), t € [0, 27] 


(b) f(x,y,z) = cosz, cas in part (a) 


11. Evaluate the following path integrals k f(x, y, z) ds, 


where 
(a) f(x, y,z) = exp vZ, and 
c: t > (1, 2, t2), t e [0, 1] 


(b) f(x,y,z) = yz, and ¢& t + (t, 3t, 2t), 
t e [1,3] 


12. Evaluate the integral of f(x, y, z) along the path c, 


where 


(a) f(x, y, z) = x cosz, c:t > ti +tĉ?j, t € [0,1] 


(b) f(x,y,z) = (x + y)/(y +2), and 
c tre (t, $t3/?, t),t e [1,2] 


13. Let f: R3\{xz plane} > R be defined by 


f(x, y, z) = 1/y?. Evaluate i f(x, y, z) ds, where 
c:[1, e] > R? is given by e(t) = (logt)i + tj + 2k. 


1) ohn Nash is the subject of Sylvia Nasar’s best-selling biography, A Beautiful M ind, a fictionalized version of which was made into a movie 
in 2001. 
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15. Let f(x, y) = 2x — y, and consider the path 
x=t4yat4,-l<t<l. 


14, (a) Show that the path integral of f(x, y) along apath 
given in polar coordinates by r =r(@), 
0 < 0 < 6,15 


& dr\2 
f(r cos8,r sind) r+ (F) dé. 


A 


(a) Compute the integral of f along this path and 
interpret the answer geometrically. 


(b) Evaluate the arc-length function s(t) and redo 
part (a) in terms of s (you may wish to consult 


(b) Compute the arc length of the path Excise 2, Sechomta): 


r =1+ cos, 0 <90 < 2x. 


Exercises 16 to 19 are concerned with the application of the path integral to the problem of defining the average value of a 
scalar function along a path. D efine the number 


Je f(x, y, 2) ds 
I(c) 


to be the average value of f along c. H ere |(c) is the length of the path: 


I(¢) = fructose 
c 


(This is analogous to the average of a function over a region defined in Section 6.3.) 


16. 


17. 


18. 


19. 


20. 


(a) Justify the formula [ fe f(x, y, z) ds]/I (c) for the 
average value of f along c using Riemann sums. 

(b) Show that the average value of f along cin 
Example 1 is (1 + $22). 

(c) In Exercise 10(a) and (b) above, find the average 
value of f over the given curves. 


Find the average y coordinate of the points on the 
semicircle parametrized by c: [0, x] > R3, 
8 +> (0,asin6,acosé);a > 0. 


Suppose the semicircle in Exercise 17 is made of a wire 
with a uniform density of 2 grams per unit length. 


(a) Whatis the total mass of the wire? 


(b) Where is the center of mass of this configuration of 
wire? (Consult Section 6.3.) 


Let c be the path given by e(t) = (t2, t, 3) fort € [0, 1]. 


(a) Find l(c), the length of the path. 
(b) Find the average y coordinate along the path c. 


Show that the path integral of afunction f(x, y) overa 


path C given by the graph of y = g(x),a <x <b is 
given by: 


b 
[re F(x, glx) /1 + [gx] dx 
E a 


Conclude that if g : [a, b] —> R is piecewise 
continuously differentiable, then the length of the graph 


21. 


22. 


23. 


24, 


25. 


26. 


of g on [a, b] is given by: 


b 
[i=] y 1 + g'(x)? dx. 
C a 


If g: [a, b] + R is piecewise continuously 
differentiable, let the length of the graph of g on [a, b] 
be defined as the length of the path t > (t, g(t)) for 

t e [a, b]. Show that the length of the graph of g on 
[a, blis 


b 
I V1 + [go ax. 
a 


Use Exercise 21 to find the length of the graph of 
y =logx from x = 1 tox =2. 


Use Exercise 20 to evaluate the path integral of 
f(x, y) = y over the graph of the semicircle 
y=V1—x2,-1l<x <1. 


Compute the path integral of f(x, y) = y? over the 
graph y = e*,0<x <1. 


Compute the path integral of f(x, y, z) = xyz over the 
path c(t) = (cost, sint, t), 0 <t < 5. 


Find the mass of a wire formed by the intersection of the 
sphere x? + y? + z? = land the plane x + y +z = 0 if 
the density at (x, y, z) is given by p(x, y, z) = x? grams 
per unit length of wire. 
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27. Evaluate Ie f ds, where f(x, y, z) =z and Galileo contemplated the following question: Does a 
c(t) = (t cost, t sint, t) forO <t < to. bead falling under the influence of gravity from a point 
A to a point B along a curve do so in the least possible 
28. Write the following limitas a path integral of time if that curve is a circular arc? For any given path, 
f(x, y, z) = xy over some path c on [0, 1] and evaluate: the time of transit T is a path integral 
N-1 T dt 
imi 2/42 2 =J eee 
dim Da (tité) v 
(= 


where the bead’s velocity is v = ./2gy, where g is the 
gravitational constant. In 1697, J ohann Bernoulli 
challenged the mathematical world to find the path in 
which the bead would roll from A to B in the least time. 
This solution would determine whether Galileo's 
considerations had been correct. 


where tı, ..., ty iS a partition of [0, 1]. 


29. Consider paths that connect the points A = (0, 1) and 
B = (1, 0) inthe xy plane, as in Figure 7.1.5.2 


(a) Calculate T for the straight-line path y = 1 — x. 


(b) Write a formula for T for Galileo's circular path, 
given by (x —1)2+(y - 1)? = 1. 


a &(0,1) 


Incidentally, Newton was the first to send his solution 
[which turned out to be a cycloid— the same curve 
Circular path (inverted) that we studied in Section 2.4, Example 4], 
Path but he did so anonymously. Bernoulli was not fooled, 
fo however. W hen he received the solution, he immediately 
knew its author, exclaiming, “I know the Lion from his 
paw.” While the solution of this problem is a cycloid, it 
> X is known in the literature as the brachistrochrone. This 
was the beginning of the important field called the 
calculus of variations. 


B 


figure 7.1.5 A curve joining the points Aand B. 


7.2 Line Integrals 


We now consider the problem of integrating a vector field along a path. We will begin 
by considering the notion of work to motivate the general definition. 


Work Done by Force Fields 


If F is a force field in space, then a test particle (for example, a small unit charge in an 
electric force field or a unit mass in a gravitational field) will experience the force F. 
Suppose the particle moves along the image of a path c while being acted upon by F. A 
fundamental concept is the work done by F on the particle as it traces out the path c. If 
cis a straight-line displacement given by the vector d and if F is a constant force, then 
the work done by F in moving the particle along the path is the dot product F - d: 


F -d = (magnitude of force) x (displacement in direction of force). 


If the path is curved, we can imagine that it is made up of a succession of infinitesimal 
straight-line displacements or that it is approximated by a finite number of straight-line 
displacements. Then (as in our derivation of the formulas for the path integral in the 
preceding section) we are led to the following formula for the work done by the force 


2We thank Tanya L eise for suggesting this exercise. 
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c(b) 


figure 7.2.1 For small yooo 


At, AS = C+ Af)—e(f) x e At. 


e oe òo = Žž o As = c(t + At) — c(t) 
a t t+At b 


x 


field F on a particle moving along a path c: [a, b] > R?: 
b 
work done by F =] F(e(t)) - c'(t) dt. 
a 


We can further justify this derivation as follows. As t ranges over a small interval t 
to t + At, the particle moves from c(t) to e(t + At), a vector displacement of As = 
c(t + At) — c(t) (see Figure 7.2.1). 

From the definition of the derivative, we get the approximation As ~ c(t) At. The 
work done in going from c(t) to c(t + At) is therefore approximately 


F(c(t))- As ~ F(c¢(t)) +e (t) At. 
If we subdivide the interval [a, b] into n equal parts a = tọ < ti < --- < tn = b, with 
At = t)41 — ti, then the work done by F is approximately 
n-1 n-1 
X F(c(t;))-As~ XO F(c(t))) elt) At. 
i=0 i=0 


Asn — œ, this approximation becomes better and better, and so it is reasonable to 
take as our definition of work to be the limit of the sum just given asn — oo. This limit 
is given by the integral 


b 
f F(c(t)) - c(t) dt. 


Definition of the Line Integral 
The previous discussion of work motivates the following definition. 


Definition Line Integrals Let F be a vector field on R? that is continuous on 
the C 1 path c: [a, b] > R?. We define f, F - ds, the line integral of F along c, by 


the formula 
b 
[F-as= | F(c(t)) -c'(t) dt; 


that is, we integrate the dot product of F with c’ over the interval [a, b]. 
Asis the case with scalar functions, we can also define f. F -dsif F(¢(t)) -¢(t) 
is only piecewise continuous. 
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example | 


solution 


For paths c that satisfy c'(t) 4 0, there is another useful formula for the line integral: 
namely, if T(t) = ¢’(t) /||¢’(t) || denotes the unit tangent vector, we have 


b 
Jre F(c(t)) -¢'(t) dt (by definition) 
b 


ES ct) J] es aes 
= f [Feet oglen (canceling je) (1) 


b 
af [F(c(t)) -T(t)]Ile'(t)|I dt. 


This formula says that f. F - dsis equal to something that looks like the path integral of 
the tangential component F(c(t)) - T(t) of F along c. In fact, the last part of formula (1) 
is analogous to the path integral of a scalar function f along c.? 

To compute a line integral in any particular case, we can either use the original defi- 
nition or integrate the tangential component of F along c, as prescribed by formula (1), 
whichever is easier or more appropriate. 


Let e(t) = (sint, cost, t) with 0 < t < 27. Let the vector field F be defined by 
F(x, y, z) = xi + yj + zk. Compute fF - ds. 


Here, F(c(t)) =F(sint, cost, t)=(sint)i+(cost)j+tk, and c¢(t)=(cost)i — 
(sint)j + k. Therefore, 


F(c(t))-c(t) = sint cost — cost sint +t =t, 


20 
[F-ds= f tdt = 27. 
c 0 


A nother common way of writing line integrals is 


[Feds [Fide Fady + Fade 
c c 


and so 


where F1, Fz, and F3 are the components of the vector field F. We call the expression 
Fı dx + F, dy + F3dz a differential form.’ By definition, the integral of a differential 
form along a path c, where e(t) = (x(t), y(t), z(t)), is 


b A- dx dy dz 
[fidx+ Fody+Faaz = (+rep + Fog) at= [Feds 


Note that we may think of ds as the differential form ds = dxi + dyj + dzk. Thus, 
the differential form Fıdx + F2dy + F3dz may be written as the dot product F - ds. 


3]f c does not intersect itself [thatis, if ¢(t,) = ¢(ty) implies tı = ty], then each point P on C (the image 
curve of c) can be written uniquely as c(t) for somet. If we define f (P) = f (c(t)) = F(c)-T(t), f is 
a function on C ; by definition, its path integral along ¢ is given by formula (1) and there is no difficulty 
in literally interpreting a F -dsasapath integral. If c intersects itself, we cannot define f as a function 
onC as before (why?); however, in this case it is still useful to think of the right side of formula (1) as 
a path integral. 


4See Section 8.5 for a brief discussion of the general theory of differential forms. 


example 2 


solution 


example 3 


solution 


example 4 


solution 


7.2 Line Integrals 


Evaluate the line integral 
fexta dz, 
c 
where c: [0, 1] + R? is given by e(t) = (t, t?, 1) = (x(t), y(t), z(t)). 
We compute dx/dt = 1, dy/dt = 2t, dz/dt = 0; therefore, 


1 
[Patona = [ (oor Fn) 


i is. 2. i 
= | (t?+2t*)dt= | ot? + Et] = =. 
f (t* + 2t") d E + zt | I5 


Evaluate the line integral 
[coseax + e* dy + €” dz, 
c 


where the path c is defined by c(t) = (1, t, et) and 0 < t < 2. 
We compute dx/dt = 0, dy/dt = 1, dz/dt = et, and so 


2 
foosza+edy+e"dz= f (0+ e+e”) dt 
c 0 


Lal 1 1 
= et + 3e” = 2e+ eei, 
2 0 


7O 7 


Let c be the path 

In 

2 
(see Figure 7.2.2). Evaluate the integral J(sin z dx + cosz dy — (xy)*/3 dz). 


x=cos?o, y=sineé, z=6, 0<0< 


In this case, we have 


dx = 2 R dy 2 Z _ 
T osina, dg =n 0 cosé, Jg 7t 
so the integral is 

[sinzax + coszdy — (xy)! dz 


c 


Tx /2 
= (—3 cos? @ sin? 6 + 3 sin? 6 cos? 6 — cos 6 sing) dé. 
0 
The first two terms cancel, and so we get 


77/2 1 
-f cosé sin ado = -|3 sm0 
0 


Tx /2 1 
e a 


0 
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figure 7.2.2 The image of the path 
x= cos’ 6, y= sin? 6, Z=0;0<06 <7n/2. 


A 


example 5 | Suppose F is the force vector field F(x, y, z) = x3i+ yj + zk. Parametrize the circle 
of radius a in the yz plane by letting ¢(@) have components 


x =0, y =acosd, z=asind, 0 <6 <2n. 


Because F(c(@)) -¢’(@) = 0, the force field F is normal to the circle at every point on 
the circle, so F will not do any work on a particle moving along the circle (Figure 7.2.3). 


figure 7.2.3 A vector field F normal to a circle in the 
y Xz plane. 


We can verify by direct computation that the work done by F is zero: 


W = [Feds= [xdx + ydy + zdz 
c c 


2x 
=f (0 — a? coso sing + a? coso sing) de = 0. 
0 


example 6 


example 7 
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If we consider the field and curve of Example 4, we see that the work done by the 
field is —ż4, a negative quantity. This means that the field impedes movement along the 
path. A 


Reparametrizations 


The line integral f F-ds depends not only on the field F but also on the path 
c: [a,b] — R°. In general, if cı and cz are two different paths in R?, Ja F- ds # 
Je, F + ds. On the other hand, we shall see that it is true that f, F -ds = + f, F-dsfor 
every vector field F if cı is what we call a reparametrization of c2; roughly speaking, 
this means that cı and cz are different descriptions of the same geometric curve. 


Definition Leth: | —> lı beaC t real-valued function that is a one-to-one map 
of an interval | = [a, b] onto another interval 1, = [aj, bı]. Let c: lı —> R? bea 
piecewise C ! path. Then we call the composition 


p=coh:| > R? 


a reparametrization of c. 


This means that p(t) = ¢(h(t)), and so h changes the variable; alternatively, we 
can think of h as changing the speed at which a point moves along the path. Indeed, 
observe that p’(t) = ¢’(h(t))h’(t), so that the velocity vector for p equals that for c but 
is multiplied by the scalar factor h’(t). 

Itis implicit in the definition that h must carry endpoints to endpoints; that is, either 
h(a) = a, and h(b) = bı, or h(a) = by and h(b) = aj. We thus distinguish two types 
of reparametrizations. If € o h is a reparametrization of c, then either 


(coh)(a) = c(a1) and (coh)(b) = c(bı) 
or 


(coh)(a) = c(b1) and (coh)(b) = c(aı). 


In the first case, the reparametrization is said to be orientation-preserving, and a particle 
tracing the path c o h moves in the same direction as a particle tracing c. In the second 
case, the reparametrization is described as orientation-reversing, and a particle tracing 
the path coh moves in the opposite direction to that of a particle tracing ¢ (Figure 7.2.4). 

For example, if C is the image of a path c, as shown in Figure 7.2.5— that is, C = 
c([aı, bı])— and if h is orientation-preserving, then co h(t) will go from c(a) to ¢(b3) 
as t goes from a to b; and if h is orientation-reversing, c o h(t) will go from c(bı) to 
c(aı) as t goes from a to b. 


Letc: [a, b] — R? bea piecewise C ! path. Then: 


(a) The path cop: [a, b] > R?, t > e(a + b — t), is areparametrization of c 
corresponding to the map h: [a, b] — [a, b], t > a +b — t; we call Cop the 
opposite path to c. This reparametrization is orientation-reversing. 


(b) The path p: [0, 1] > R?, t + c(a + (b — a)t), is an orientation-preserving 
reparametrization of c corresponding to a change of coordinates 
h: [0,1] > [a, b],t =a + (b —a)t. A 


364 Integrals Over Paths and Surfaces 


figure 7.2.4 Illustrating (a) an 
orientation-preserving 
reparametrization, and (b) an 


orientation-reversing Graph of h Graph of h 
reparametrization. a e——_______e, aie eh 
h \ Pn 
a; e—a h a; e—a D 
h preserves orientation h reverses orientation 
(a) (b) 


p(b)=c(b;) 


eh) = p(t) 


figure 7.2.5 The path p = co his 
a reparametrization of ©. 


p(a) = ¢(a,) 


a 


C a. s iI 03. En” wr OD 


a t b ay h(t) bı 


Theorem 1 Change of Parametrization for Line Integrals Let F bea 
vector field continuous on the C! path c: [a;, bı] > R?, and let p: [a, b] > R? 
be a reparametrization of c. If p is orientation-preserving, then 


fF-as= [Fas 
p c 


and if p is orientation-reversing, then 


[Fds=— [Fas 
p c 
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proof By hypothesis, we havea map h such that p = coh. By the chain rule, 
and so 


b 
[Feas= f [F(e(h(t))) -e(h(t))]h’(t) dt. 
p a 


Changing variables with s = h(t), this becomes 


F(c(s))-c'(s)ds= / F-ds if p is orientation- 
a £ preserving 
a1 
| F(c(s))-c(s)ds=— /F-ds if pisorientation- 
bı i reversing. E 


Theorem 1 also holds for piecewise C ? paths, as may be seen by breaking up the 
intervals into segments on which the paths are of class C t and summing the integrals 
over the separate intervals. 

Thus, if it is convenient to reparametrize a path when evaluating an integral, Theo- 
rem 1 assures us that the value of the integral will not be affected, except possibly for 
the sign, depending on the orientation. 


example 8 | LetF(x, y,z) = yzi+xzj + xyk and c: [—5, 10] — R? be defined by t + (t, t?, t?). 
Evaluate f, F -dsand fe, F-ds. 


solution | Forthepathc, wehavedx/dt=1, dy/dt=2t, dz/dt=3t?, and F(e(t)) = tři+ttj+t?k. 
Therefore, 


10 /_ dx dy dz 
fras. (Hg shh tag) at 


10 
= (t5 + 2t5 + 3t5) dt = [t9]!2, = 984,375. 
5 


On the other hand, for 


Cop: [—5, 10] > R?, tre e(5 — t) = (5 — t, (5 — t)?, (5 —t)?), 


we have dx/dt = —1, dy/dt = —10 + 2t = —2(5 — t), dz/dt = —75 + 30t — 3t? = 
—3(5 — t)?, and F(¢op(t)) = (5 — t)*i + (5 — t)4j + (5 — t)?k. Therefore, 


10 
f F-ds= [—(5 = t)? — 2(5 — t)? — 3(5 — t)°]dt = [(5 — t)ê]1% = —984, 375. 
Cop -5 A 


We are interested in reparametrizations, because if the image of a particular € can be 
represented in many ways, we want to be sure that path and line integrals depend only 
on the image curve and not on the particular parametrization. For example, for some 
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problems the unit circle may be conveniently represented by the map p given by 
x(t) = cos2t, y(t) = sin2t, O0<t<z. 


Theorem 1 guarantees that any integral computed for this representation will be the 
same as when we represent the circle by the map c given by 


x(t) = cost, y(t) = sint, 0<t <2n, 


because p = €c o h, where h(t) = 2t, and thus p is an orientation-preserving repara- 
metrization of c. However, notice that the map y given by 


y(t) = (cost, sint), 0 <t< 4r 


is not a reparametrization of c. Although it traces out the same image (the circle), it 
does so twice. (Why does this imply that y is not a reparametrization of c?) 

The line integral is an oriented integral, in that a change of sign occurs (as we have 
seen in Theorem 1) if the orientation of the curve is reversed. The path integral does not 
havethis property. This follows from the fact that changing t to —t (reversing orientation) 
just changes the sign of c’(t), not its length. This is one of the differences between line 
and path integrals. The following theorem, which is proved by the same method as in 
Theorem 1, shows that path integrals are unchanged under reparametrizations— even 
orientation-reversing ones. 


Theorem 2 Change of Parametrization for Path Integrals Let c be 
piecewise C t, let f be a continuous (real-valued) function on the image of c, and 
let p be any reparametrization of c. Then 


[toy 2)és= | F(x, z) ds. (2) 
c p 


Line Integrals of Gradient Fields 


We next consider a useful technique for evaluating certain types of line integrals. Recall 
that a vector field F is a gradient vector field if F = Vf for some real-valued function 
f. Thus, 


TEREPEN 
əx əy” az 


Suppose g and G are real-valued continuous functions defined on a closed interval 
[a, b], that G is differentiable on (a,b), and that G’ = g. Then by the fundamental 
theorem of calculus 


b 
/ g(x) dx = G (b) — G (a). 


Thus, the value of the integral of g depends only on the value of G at the endpoints 
of the interval [a, b]. Because Vf represents the derivative of f, we can ask whether 


example 9 


solution 
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J. Vf -dsis completely determined by the value of f at the endpoints c(a) and ¢(b). 
The answer is contained in the following generalization of the fundamental theorem of 
calculus. 


Theorem 3 Line Integrals of Gradient Vector Fields Suppose f: R? > R 
is of class C+ and that c: [a, b] > R? is a piecewise C t path. Then 


[x .ds= f(e(b)) — f(e(a)). 


c 


proof Apply the chain rule to the composite function 
F: t f(c(t)) 
to obtain 
F(t) = (f o c)'(t) = Vf (c(t)) -c'(t). 


The function F is a real-valued function of the variable t, and so, by the fundamental 
theorem of single-variable calculus, 


b 
f F(t) dt = F (b) — F (a) = f(elb)) — f (ela)). 


Therefore, 


b b 
[vt-as= f Vre e t= f F'(t)dt = F (b) — F (a) 
= f(c(b)) — f (c(a)). m 


Letc bethe path e(t) = (t*/4, sin?(tz/2), 0), t € [0, 1]. Evaluate 


fys 
c 


(which means f, y dx + x dy + 0 dz). 


We recognize y dx + x dy, or, equivalently, the vector field yi + xj + Ok, as the gradient 
of the function f(x, y, z) = xy. Thus, 


1 1 

[yex+xay= fn) - fle())=5-1-0=5. | 
Obviously, if we can recognize the integrand as a gradient, then evaluation of the 
integral becomes much easier. For example, try to work out the integral in Example 9 
directly. In one-variable calculus, every integral is, in principle, obtainable by finding an 
antiderivative. For vector fields, however, this is not always true, because a given vector 
field need not be a gradient. This point will be examined in detail in Section 8.3, where 


368 Integrals Over Paths and Surfaces 


figure 7.2.6 A simple curve that 

has no self-intersections is shown 

on the left. On the right is a curve 
with a self-intersection, so it is not 
simple. 


figure 7.2.7 There 
are two possible 
senses of direction 
on a curve joining 
P and Q. 


figure 7.2.8 A simple closed 
curve (left) and a closed curve 
that is not simple (right). 


e(b) e(b) 


Simple curve Not a simple curve 


we derive a test to determine when a vector field F is a gradient; that is, when F = Vf 
for some f. 


Line Integrals Over Geometric Curves 


Wehave seen how to define path integrals (integrals of scalar functions) and line integrals 
(integrals of vector functions) over parametrized curves. We havealso seen that our work 
is simplified if we make a judicious choice of parametrization. B ecause these integrals 
are independent of the parametrization (except possibly for the sign), it seems natural 
to express the theory in a way that is independent of the parametrization, and that is 
thereby more “geometric.” We do this briefly and somewhat informally in the following 
discussion. 


Definition We define a simple curve C to be the image of a piecewise C t map 
c: | — R that is one-to-one on an interval |; ¢ is called a parametrization of 
C. Thus, a simple curve is one that does not intersect itself (Figure 7.2.6). If 
| = [a, b], we call c(a) and c(b) endpoints of the curve. 

Each simple curve C has two orientations or directions associated with it. If P 
and Q are the endpoints of the curve, then we can consider C as directed either 
from P to Q or from Q to P. The simple curve C together with a sense of direction 
is called an oriented simple curve or directed simple curve (Figure 7.2.7). 


Definition Simple Closed Curves By a simple closed curve we mean the 
image of a piecewise C! map c: [a, b] — IR? that is one-to-one on [a, b) and 
satisfies c(a) = c(b) (Figure 7.2.8). If c satisfies the condition c(a) = c(b), but 
is not necessarily one-to-one on [a, b), we call its image a closed curve. Simple 
closed curves have two orientations, corresponding to the two possible directions 
of motion along the curve (Figure 7.2.9). 


If C is an oriented simple curve or an oriented simple closed curve, we can unam- 
biguously define line integrals along them. 


c(a) = e(b) e(a) = e(b) 
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C. C 
figure 7.2.9 Two possible orientations 
for a simple closed curve C. 


Line Integrals and Path Integrals Over Oriented Simple Curves 
and Simple Closed Curves C: 


a ane [ tas= [fs (3) 
C c i c 


where c is any orientation-preserving parametrization of C. 


These integrals do not depend on the choice of c as long as c is one-to-one (except 
possibly at the endpoints) by virtue of Theorems 1 and 2.° The point we want to make 
hereis that, although a curve must be parametrized to makeintegration along ittractable, 
itis not necessary to include the parametrization in our notation for the integral. 


If | = [a, b] is a closed interval on the x axis, then | , as a curve, has two orientations: 
one corresponding to motion from a to b (left to right) and the other corresponding to 
motion from b to a (right to left). If f is a real-valued function continuous on |, then 
denoting | with the first orientation by | * and | with the second orientation by |~, we 
have 


b a 
' food = f Fix)dx=— f Fixddx=— f tod a 


A given simple closed curve can be parametrized in many different ways. Fig- 
ure 7.2.10 shows C represented as the image of a map p, with p(t) progressing in 
a prescribed direction around an oriented curve C as t ranges from a to b. Note that 
p’(t) points in this direction also. The speed with which we traverse C may vary from 
parametrization to parametrization, but as long as the orientation is preserved, the inte- 
gral will not, according to Theorems 1 and 2. 

The following precaution should be noted in regard to these remarks. It is possible 
to have two mappings c and p with the same image, and inducing the same orientation 


on the image, such that 
[Fos [reas 
c p 


>We have not proved that any two one-to-one paths ¢ and p with the same image must be reparametriza- 
tions of each other, but this technical point will be omitted. 
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figure 7.2.11 A curve can 
be made up of several 
components. 


P(t) 
p(t) 


C figure 7.2.10 As t goes from ato b, 
p = pb P p(t) moves around the curve Cin 
G Pt?) PtH) some fixed direction. 


P(t) P'o) 
( n e a S a 


att t hb 


For an example, let c(t) = (cost, sint, 0) and p(t) = (cos2t, sin 2t, 0),0 <t <2 
with F(x, y,z) =(y, 0, 0). Then F(x, y, z)=y, Fa(x, y, 2) =0, and F3(x, y, z)= 


SO 
2x 2n 
[F-s= f F (ett) dt =- | ai ke ae 
c 0 dt 0 


But JF -ds = —2 fr sin’? 2tdt = —2x. Clearly, c and p have the same image, 
namely, the unit circle in the xy plane. M oreover, they traverse the unit circle in the same 
direction; yet f F -ds + JF - ds. The reason for this is that ¢ is one-to-one, but p is 
not (p traverses the unit circle twice in a counterclockwise direction); therefore, p is not 
a parametrization of the unit circle as a simple closed curve. 

As a consequence of Theorem 1 and generalizing the notation in Example 10, we 
introduce the following convention: 


Line Integrals Over Curves with Opposite Orientations LetC~ be 
the same curve as C , but with the opposite orientation. Then 


[F-ds=- F-ds. 
G Gs 


We also have: 


Line Integrals Over Curves Consisting of Several Components Let 
C be an oriented curve that is made up of several oriented component curves 
Ci,i =1,...,k, asin Figure 7.2.11. Then weshall writeC =C,+C2+---+Cx. 
Because we can parametrizeC by parametrizing the pieces Ci, ..., C x separately, 
we can prove that 


| e= F-ds+ / F-ds+---+ / F-ds. (4) 
£ Cai (Ca Ck 


One reason for writing a curve as a sum of components is that it may be easier to 
parametrize the components C; individually than it is to parametrize C as a whole. If 
that is the case, formula (4) provides a convenient way of evaluating fe F- ds. 


example 11 


solution 
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The ar Notation for Line Integrals 


Sometimes one writes, as we occasionally do later, the line integral using the notation 


[Foe 
C 


The reason is that we think of describing a C* path c in terms of a moving position 
vector based at the origin and ending at the point c(t) at time t. Position vectors are 
often denoted by r= xi + yj + zk, and so the curve is described using the notation 
r(t) =x(t)i+ y(t)j + z(t)k in place of c(t). By definition, the line integral is given by 


b 
/ F(r(t)) Sr dt 


Formally canceling the dt’s, and using the parametrization independence to replace the 
limits of integration with the geometric curve C, we arrive at the notation fẹ F-dr. 


Consider C , the perimeter of the unit square in R?, oriented in the counterclockwise 
sense (see Figure 7.2.12). Evaluate the line integral 


[Latya 
(0 


figure 7.2.12 The perimeter of the unit square, 
parametrized in four pieces. 


(0, 0) 


We evaluate the integral using a convenient parametrization of C that induces the given 
orientation. For example: 


(t, 0) 0<t<l 
(1,t=1) 1<t<2 
(3—t, 1) 2<t <3 
(0,4 —t) 3<t<4. 


c [0,4 > R, te 


Then 
il 2 
2 = 2 = 
f» ax+aydy= f tt +odt+ f t04 1)]dt 
3 4 
—(3 =t)? +0] dt 0 + 0) dt 
+f (3— t)? +0] +f +0) 


a het a eee ee 
3 2 3 72 
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Now let us reevaluate this line integral, using formula (4) and parametrizing the C; 
separately. Notice that C = Cı + C2 +C3 + C4, where C; are the oriented curves 
pictured in Figure 7.2.12. These can be parametrized as follows: 


Ci: q(t) =(t,0),0<t<1 
Co: &(t) =(1,t),0<t<1 
C3: ¢3(t) =(1-t,1),0<t<l 
C4: c(t) =(0,1—-t),0<t <1, 
and so 
[ ex+wa= f t? dt = 
Ci 0 3 
2 z 1 
1: ax-+xydy= f tdt = = 
Co 0 2 
2 i 2 1 
Je dx-+xydy= f —~(1—t)*dt=—= 
C3 3 
f eacrryay= [ost =o. 
C4 0 
Thus, again, 
1 1 a 1 
2 = = 
E dx + xydy = 5 +5 3 +0=>5- P 


Aninteresting application of thelineintegral isthe mathematical formulation of A mpère’s 
law, which relates electric currents to their magnetic effects. Suppose H denotes a mag- 
netic field in R?, and let C be a closed oriented curve in R?. In appropriate physical 
units, A mpeére’s law states that 


[Heise 1, 
C 


where | is the net current that passes through any surface bounded by C (see 
Figure 7.2.13). 


figure 7.2.13 The magnetic field H 
surrounding a wire carrying a current / 
satisfies Ampére’s law: fH -ds=/. 


Current 7 


®The discovery that electric currents produce magnetic effects was made by Haas Christian Oersted 
circa 1820. See any elementary physics text for discussions of the physical basis of these ideas. 
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Finally, let us mention that the line integral has another important physical meaning, 
specifically, the interpretation of fe V -ds as circulation, where V is the velocity field 
of a fluid, as we shall discuss in Section 8.2. Thus, a wide variety of physical concepts, 
from the notion of work to electromagnetic fields and the motions of fluids, can be 
analyzed with the help of line integrals. 


exercises 


1. Evaluate the line integral 


pres 
c 


where F(x, y) = y?i — xyj and C is the part of the circle 
x2 + y? = 1 that starts at (1, 0) and ends at (0, 1), 
oriented counterclockwise. 


. Repeat Problem 1 for F = yi + 2xyj, where C is the 
entire unit circle x2 + y? = 1. 


a LetF(x, y,z) = xi + yj +zk. Evaluate the integral of F 
along each of the following paths: 

(a) c(t) = (t,t, t), 0<t<l 

(b) e(t) = (cost, sint, 0), 0<t<2r 

(c) e(t) = (sint, 0, cost), 

(d) elt) = (t2, 3t, 2t3), 


0<t<2r 
-l<t<2 


. Evaluate each of the following line integrals: 
(a) fex dy —y dx, 


0<t<2z 


(b) f.xdx+y dy, 


0<t<2 


c(t) = (cost, sint), 
c(t) = (coszt, sin zt), 


(c) Ie yz dx + xz dy + xy dz, where c consists of 
straight-line segments joining (1, 0, 0) to (0, 1, 0) to 
(0, 0, 1) 

(d) f x? dx — xy dy + dz, where c is the parabola 
z = x2, y = 0 from (—1, 0, 1) to (1, 0, 1). 


. Consider the force field F(x, y, z) = xi + yj + zk. 
Compute the work done in moving a particle along the 
parabola y = x?, z = 0, from x = —1 to x = 2. 


» Let c be a smooth path. 
(a) Suppose F is perpendicular to c'(t) at the point c(t). 


Show that 
fF -ds=0. 
C 


10. 


(b) If F is parallel to c'(t) at e(t), show that 


JF- as= [rus 
c c 


[By parallel to c'(t) we mean that 
F(c(t)) = a(t)e’(t), where a(t) > 0.] 


. Suppose the path c has length |, and ||F|| < M . Prove 


that 


| [Fas] =mi. 
€ 


» Evaluate JF- ds, where F(x, y, z) = yi + 2xj + yk 


and the path c is defined by e(t) = ti + t4j + t?k, 
0<t<1. 


. Evaluate 


[y+ -xiy + zaz 
ç 


for each of the paths e(t) = (t, t”, 0), 0 < t < 1, where 
DEE 2, 3, ans 


This exercise refers to Example 12. Let L be avery long 
wire, a planar section of which (with the plane 
perpendicular to the wire) is shown in Figure 7.2.14. 


y 


figure 7.2.14 A planar section of a long wire 
and a curve Cabout the wire. 
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15. 


16. 
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Suppose this plane is the xy plane. Experiments show 
that H is tangent to every circle in the xy plane whose 
center is the axis of L , and that the magnitude of H is 
constant on every such circle C . Thus, H = H T, where 
T isa unit tangent vector to C and H is some scalar. 
Using this information, show that H = | /2zr, where r 
is the radius of circle C and | is the current flowing in 
the wire. 


The image of the path t —> (cos? t, sin? t), 0 < t < 27r 
in the plane is shown in Figure 7.2.15. Evaluate the 
integral of the vector field F(x, y) = xi + yj around this 
curve. 


0 D=e(3) 


(1, 0) =e) (1, 0) = e(0) = e(27) 


0-p=(8) 


figure 7.2.15 The hypocycloid e(f) = (cos? t, sin? A 
(Exercise 11). 


Suppose cj and cz are two paths with the same endpoints 
and F is a vector field. Show that f, F- ds= fẹ F- ds 
is equivalent to f. F- ds = 0, where C is the closed 
curve obtained by first moving along cı and then moving 
along C2 in the opposite direction. 


Let c(t) be a path and T the unit tangent vector. W hat is 
fT- ds 
c 


Let F = (z3 + 2xy)i + x2j + 3xz?k. Show that the 
integral of F around the circumference of the unit square 
with vertices (+1, +1) is zero. 


Using the path in Exercise 11, observe that a C 1 map 
c: [a, b] > R? can have an image that does not “look 
smooth.” Do you think this could happen if ¢’(t) were 
always nonzero? 


W hat is the value of the integral of a gradient field 
around a closed curve C ? 


17. 


18. 


Evaluate the line integral 
f 2xyz dx + x2z dy + xy dz, 
c 
where C is an oriented simple curve connecting (1, 1, 1) 
to (1, 2, 4). 


Suppose Vf (x, y, z) = 2xyze¥*i + ze%*j + ye*’k. If 
f (0, 0,0) = 5, find f (1, 1, 2). 


19. Consider the gravitational force field (with 


20. 


21. 


G =m =M = 1) defined [for (x, y, z) + (0, 0, 0)] by 


1 
(xX? + y? + z?)3/2 


F(x, y,z) = (xi+ yj + zk). 
Show that the work done by the gravitational force as a 
particle moves from (x1, yi, 21) to (x2, Y2, Z2) along any 


path depends only on the radii Rı = \/xf + yf +z? 
and R2 = «/x} + y +23. 


A cyclist rides up a mountain along the path shown in 
Figure 7.2.16. She makes one complete revolution 
around the mountain in reaching the top, while her 
vertical rate of climb is constant. Throughout the trip she 
exerts a force described by the vector field 


F(x, y,z) = yi+xj +k. 


What is the work done by the cyclist in traveling from 
A to B? What is unrealistic about this model of a cyclist? 


figure 7.2.16 How much work is done in 
cycling up this mountain? 


Letc: [a, b] + R? bea path such that c'(t) + 0. Recall 
from Section 4.1 that when this condition holds, c is said 
to be regular. Let the function f be defined by the 
formula f(x) = f |le’(t)|j dt. 


(a) What is df /dx? 


(b) Using the answer to part (a), prove that 
f: [a, b] —> [0, L], where L is the length of c, has a 


22. 


differentiable inverse g: [0, L] — [a, b] satisfying 
f og(s) =S, go f(x) =x. (You may use the 
one-variable inverse function theorem stated at the 
beginning of Section 3.5.) 

Compute dg /ds. 


Recall that a path s + b(s) is said to be of unit 


(c 
(d 


speed, or parametrized by arc length, if ||b’(s) |] = 1. 


Show that the reparametrization of ¢ given by 

b(s) = c o g(s) is of unit speed. Conclude that any 
regular path can be reparametrized by arc length. 
(Thus, for example, the Frenet formulas in Exercise 
23 of Section 4.2 can be applied to the 
reparametrization b.) 


Along a “thermodynamic path” C in (V,T, P ) space, 


(i) The heat gained is fe Av dV + Ky dT, where 
Ay, Ky are functions of (V, T, P ), depending on 
the particular physical system. 


(ii) The work doneis fo P dV. 


For avan der Waals gas, we have 


T a RT 
] Av = 5—7 


R 
PWT =< TT 


Vb V2’ 
and Ky = constant, 


where R, b,a, and } are known constants. Initially, the 
gas is at a temperature To and volume Vo. 
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(a) An adiabatic process is a thermodynamic motion 
(V (t), T(t), P (t)) for which 


dT dT/dt Av 
dV dv/dt Ky’ 


If the van der Waals gas undergoes an adiabatic 
process in which the volume doubles to 2Vo, 
compute 

(1) the heat gained; 

(2) the work done; and 

(3) the final volume, temperature, and pressure. 


(b) After the process indicated in part (a), the gas is 
cooled (or heated) at constant volume until the 
original temperature To is achieved. Compute 
(1) the heat gained; 

(2) the work done; and 
(3) the final volume, temperature, and pressure. 


(c) After the process indicated in part (b), the gas is 
compressed until the gas returns to its original 
volume Vo. The temperature is held constant 
throughout the process. Compute 

(1) the heat gained; 

(2) the work done; and 

(3) the final volume, temperature, and pressure. 
For the cyclic process described in parts (a), (b), (c), 
compute 

(1) the total heat gained; and 

(2) the total work done. 


(d 


7.3 Parametrized Surfaces 


In Sections 7.1 and 7.2, we studied integrals of scalar and vector functions along curves. 
Now we turn to integrals over surfaces and begin by studying the geometry of surfaces 
themselves. 


Graphs Are Too Restrictive 


We are already used to one kind of surface, namely, the graph of a function f(x, y). 
Graphs were extensively studied in Chapter 2, and we know how to compute their 
tangent planes. However, it would be unduly limiting to restrict ourselves to this case. 
For example, many surfaces arise as level surfaces of functions. Suppose our surface 
S is the set of points (x, y, z), where x — z+ z? = 0. Here S is a sheet that (relative 
to the xy plane) doubles back on itself (see Figure 7.3.1). Obviously, we want to call 
S a surface, because it is just a plane with a wrinkle. However, S is not the graph 
of some function z = f(x, y), because this means that for each (Xo, yo) € R? there 
must be one Zo with (Xo, Yo, Zo) € S. AS Figure 7.3.1 illustrates, this condition is 
violated. 

Another example is the torus, or surface of a doughnut, which is depicted in Fig- 
ure 7.3.2. Anyone would call a torus a surface; yet, by the same reasoning as before, a 


376 Integrals Over Paths and Surfaces 


figure 7.3.3 The first step in 
obtaining a torus from a 
rectangle is to make a cylinder. 


figure 7.3.4 Bend the cylinder 
and glue the ends to get a torus. 


re 


? 
l 

L figure 7.3.1 A surface that is not 
——~ the graph of a function z= f(x, y). 
! f 
b (Xo, Yo, 0) 


figure 7.3.2 The torus is not the graph of a function 
of the form z= f(x, y). 


x 


torus cannot be the graph of a differentiable function of two variables. These observa- 
tions encourage us to extend our definition of a surface. 

The motivation for the extended definition that follows is partly that a surface can 
be thought of as being obtained from the plane by “rolling,” “bending,” and “pushing.” 
For example, to get a torus, we take a portion of the plane and roll it (see Figure 7.3.3), 
then take the two “ends” and bring them together until they meet (Figure 7.3.4). 


Parametrized Surfaces as Mappings 


In our study of differential calculus we dealt with mappings f: A c R" > R™. Taking 
n = 2andm = 3 corresponds to the case of a two-dimensional surface in 3-space. With 
surfaces, just as with curves, we want to distinguish a map (a parametrization) from its 
image (a geometric object). This leads us to the following definition. 


— 
— Ee- = 


Ends glued 


figure 7.3.5 ® “twists” and 
“pends” D onto the surface 


S= (D). 


example | 
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x 


S=@(D) 


Definition Parametrized Surfaces A parametrization of asurfaceis a func- 
tion ®: D c R? — R?, where D is some domain in R?. The surface S corre- 
sponding to the function ® is its image: S = ®(D). We can write 


®(u, v) = (x(u, v), y(u, v), z(u, v)). 


If ® is differentiable or is of class C+ [which is the same as saying that x(u, v), 
y(u, v), and z(u, v) are differentiable or C+ functions of (u, v)], we call S a 
differentiable or a C + surface. 


We can think of ® as twisting or bending the region D in the plane to yield the 
surface S (see Figure 7.3.5). Thus, each point (u, v) in D becomes a label for a point 
(x(u, v), y(u, v), z(u, v)) on S. 

Of course, surfaces need not bend or twist at all. In fact, planes are flat, as shown in 
our first, and simplest, example. 


In Section 1.3 we studied the equation of a plane P . We did so in terms of graphs and 
level sets. Now we examine the same notion using a parametrization. 

Let P bea plane that is parallel to two vectors œ and £ and that passes through the 
tip of another vector y, asin Figure 7.3.6. 


Go Yo» Zo) 


x 


figure 7.3.6 Describing a plane parametrically. 
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figure 7.3.7 The tangent vectors 
T, and T, that are tangent to the 
curve on a surface S, and hence 
tangent to S. 


Our goal in this example is to find a parametrization of this plane. Notice that the vector 
æ x B = N, which we also write as Ai + Bj + C k, is normal to P . If the tip of y is the 
point (Xo, Yo, Zo), then the equation of P as a level set (as discussed in Section 1.3) is 
given by: 


A(X — Xo) + B(y — Yo) + C (Z — Zo) = 0. 


However, the set of all points on the plane P can also be described by the set of all 
vectors that are y plus a linear combination of œ and £. Using our preferred choice of 
real parameters u and v, we arrive at the parametric equation of the plane P : 


(u, v) =u +ßfv+y. A 


Tangent Vectors to Parametrized Surfaces 


Suppose that ® is a parametrized surface that is differentiable at (uo, vo) € R?. Fixing 
u at Up, we get a map R —> R? given by t ++ ®(Ug, t), whose image is a curve on the 
surface (Figure 7.3.7). From Chapters 2 and 4 we know that the vector tangent to this 
curve at the point ®(Uo, vo), which we denote by T,, is given by 


© ax 
a 


v 


. Oy . Z 
= — = — (Uo, Vo)i + — (Uo, — (Uo, vo)k. 
Ju Ju or vodi + a o vo)j + T o Uo) 
Similarly, if we fix v and consider the curvet +> ®(t, vo), we obtain the tangent vector 
to this curve at ®(Uo, vo), given by 


a® 0X . Oy . oZ 
= — = —(Up, vo)i + — (uo, — (Uo, vo)k. 
aU aa o Vo) a 0 volj + a o vo) 


u 


Regular Surfaces 


Because the vectors T, and T, are tangent to two curves on the surface at a given point, 
the vector Tu x T, ought to be normal to the surface at the same point. 

We say that the surface S is regular or smooth’ at ®(Uo, vo), provided that 
Tu x T, Æ Oat (Uo, vo). The surface is called regular if it is regular at all points 
®(Uo, vo) € S. Thenonzero vector Tu x T, is normal to S (recall that the vector product 


(uo, vo) BaN y T, 


u v = constant 
u = constant 


x 


7Strictly speaking, regularity depends on the parametrization © and not just on its image S. Therefore, 
this terminology is somewhat imprecise; however, it is descriptive and should not cause confusion. (See 
Exercise 19.) 


example 2 


solution 


x 


figure 7.3.8 The surface 


Z= 4/ X2 + y? is a cone. It is not 


regular at its tip. 
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of Tu and T, is perpendicular to the plane spanned by T, and T,); the fact that it is 
nonzero ensures that there will be a tangent plane. Intuitively, a smooth surface has no 
“corners.” 


Consider the surface given by the equations 
X =u CoSv, y=usinv, zZz=u, u>0. 
Is this surface differentiable? Is it regular? 


These equations describe the surface z = ,/x2 + y? (square the equations for x, y, and 
z to check this), which is shown in Figure 7.3.8. This surface is a cone with a “point” at 
(0, 0, 0); itis a differentiable surface because each component function is differentiable 
as a function of u and v. However, the surface is not regular at (0, 0, 0). To see this, 
compute T, and T, at (0, 0) € R?: 


u = 5r = pyle Oi + 5 (0, Of + (0, O)k = (cos 0)i + (sin O)j +k = i +k, 
and similarly, 
zE = = 0(—sin0)i + 0(cos0)j + 0k = 0. 


Thus, T, x T, = 0, and so, by definition, the surface is not regular at (0, 0, 0). A 


Tangent Plane to a Parametrized Surface 


We can use the fact that n = Tu x T, is normal to the surface to both formally define 
the tangent plane and to compute it. 


Definition The Tangent Plane to a Surface If a parametrized surface 
®:D c R? > R’ isregular at ®(Ug, vo)— thatis, if Tu xT, 4 Oat (uo, vo)— we 
define the tangent plane of the surface at ®(uo, vo) to be the plane determined by 
the vectors Tu and T,. Thus, n = Tu x T, is anormal vector, and an equation of 
the tangent plane at (Xo, Yo, Zo) on the surface is given by 


(x — Xo, Y — Yo. Z — Zo) -n = 0, (1) 


where n is evaluated at (Uo, vo); that is, the tangent plane is the set of (x, y, z) 
satisfying (1). If n = (nz, N2, n3) = Nai + n2j + nak, then formula (1) becomes 


nal = xo) Mot = Yo) F nle = Za)! = 0) (1’) 


Bin Section 3.5 we showed that level surfaces f(x, y, z) = 0 were in fact graphs of functions of two 
variables in some neighborhood of a point (Xo, Yo, Zo) satisfying V f (Xo, Yo, Zo) # 0. This united two 
concepts of a surface— graphs and level sets. A gain, using the implicit function theorem, itis likewise 
possible to show that the image of a parametrized surface ® in the neighborhood of a point (Uo, vo) 
where Tux T, Æ 0 is also the graph of a function of two variables. Thus, all definitions of a surface 
are consistent. (See Exercise 20.) 
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example 3 


solution 


example 4 


solution 


Let ®: R? — R? be given by 
X=ucosv, y=usinv,  Z=U +v’. 
W here does a tangent plane exist? Find the tangent plane at (1, 0). 
We compute 
T, =(cosv)it+(sinv)j+2uk and T,= —u(sinv)i+u(cosv)j + 2uk, 


so the tangent plane at the point ®(uo, vo) is the set of vectors through ®(uo, vo) 
perpendicular to 


(Ty x T,)(Uo, vo) = (—2u§ cos vo + 2vo SiN vo, —2U§ SiN vo — 2vo COS vo, Uo) 


if this vector is nonzero. Because T, x T, is equal to Oat (Uo, vo) = (0, 0), we cannot 
find a tangent plane at ®(0, 0) = (0, 0, 0). However, we can find an equation of the 
tangent plane at all the other points, whereT, x T, 40.Atthepoint (1, 0) =(1, 0, 1), 


n = (T, xT,)(1, 0) = (—2, 0, 1) = —2i + k. 


Because we have the vector n normal to the surface and a point (1, 0, 1) on the surface, 
we can use formula (1) to obtain an equation of the tangent plane: 


—2(x —1)+(z—1) = 0; thatis,z =2x- 1. A 


Supposea surface S isthe graph of a differentiablefunction g: R? —> R. Write S in para- 
metric form and show that the surface is smooth at all points (Uo, vo, g(Uo, vo)) € R°. 


Write S in parametric form as follows: 
x=u, y=v, z=g(u,v), 


which is the same as z = g(x, y). Then at the point (uo, vo), 
ð 
Ty ja Big vo) kK and T, =j + Stuy, vo)k, 
ðu dv 
and for (Ug, vo) € R2, 


. oO . 
Pee =o the sel — ye ERO. (2) 
ou Ov 


This is nonzero because the coefficient of k is 1; consequently, the parametrization 
(u, v) +> (u, v, g(u, v)) is regular at all points. M oreover, the tangent plane at the point 
(Xo, Yo, Zo) = (Uo, vo, J(Uo, vo)) is given, by formula (1), as 


əg əg _ 
(X — Xo, Y — Yo. Z 20) E, Z1) =0, 


where the partial derivatives are evaluated at (Uo, vo). Remembering that x = u and 
y = v, we can write this as 


Z eal x0) + (B)ty yor (3) 


where dg/dx and ag/ay are evaluated at (Xo, yo). A 


This example also shows that the definition of the tangent plane for parametrized 
surfaces agrees with the one for surfaces obtained as graphs, because equation (3) is 
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the same formula we derived (in Chapter 2) for the plane tangent to S at the point 
(Xo, Yo, Zo) € S. 

Itis also useful to consider piecewise smooth surfaces, that is, surfaces composed of 
a certain number of images of smooth parametrized surfaces. For example, the surface 
of a cube in IR? is such a surface. These surfaces are considered in Section 7.4. 


example 5 | Finda parametrization for the hyperboloid of one sheet: 


xX +y a7 = 1, 


solution | Because x and y appear in the combination x? + y?, the surface is invariant under 


rotation about the z axis, and so it is natural to write 


X =T cosé, y =r sinð. 


Then x? + y? — 2z? = 1 becomes r? — z? = 1. This we can conveniently parametrize by? 


r = cosh u, z=sinhu. 
Thus, a parametrization is 
x = (cosh u) (cos 0), y = (cosh u) (sin 0), z=sinhu, 


where 0 < 0 < 2m, =œ <U <œ. A 


exercises 


In Exercises 1 to 3, find an equation for the plane tangent to the given surface at the specified point. 


L x=2u, y=u?+v z=v%, at(0,1,1) 3. x=u?, y=usine’, z= tucose’, 
at (13, —2, 1) 
2.x=ue-v*, y=utv, z=u* +40, 
at (-4, Ji 2) 4. Atwhat points are the surfaces in Exercises 1 and 2 
regular? 


In Exercises 5 and 6, find all points (uo, vo), where S = ®(uoọ, vo) is not smooth (regular). 
5. (u, v) = (U? — v2, u2 + v%, v) 
6. (u, v) = (u — v, u + v, 2uv) 
7. Match the following parameterizations to the surfaces 


shown in the figures. 


(a) &(u, v) = ((2V1 + u2) cos v, (2V1 + U2) sin v, u) 
(b) ®(u, v) = (3 cosu sin v, 2 sinu sin v, cos v) 

(c) (u,v) = (u, v, u?) 

(d) (u,v) = (u cos v, u sin v, u) 


Recall from one-variable calculus that coshu = (e" + e™)/2 and sinhu = (e! — e™")/2. We easily verify from these definitions that 


cosh? u — sinh? u = 1. 
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(i) (ii) 


(iii) (iv) 
8. Match the following parametrizations to the surfaces 
shown in the figures. 
(a) (u, v) = (u cos v, u sin v, 4 — u cos v — u sin v); 
u € [0, 1], v e [0, 27] 
(b) ®(u, v) = (u cos v, u sin v, 4 — u?) 
(c) (u, v) = (u, v, $(12 — Bu — 3v)) 


(d) @(u, v) = ((u2 + 6u + 11) cos v, 
u, (u2 + 6u + 11) sin v) 


z g 
A 


(ii) 


10. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


Find an expression for a unit vector normal to the surface 


x = cosvsinu, y =sinvsinu, Z = COSU 


at the image of a point (u, v) for u in [0, x] and vin 
[0, 2x]. Identify this surface. 


Repeat Exercise 9 for the surface 


x = 3cos@sing, y =2sinð sine, Z=COS 
for @ in [0, 2x] and ¢ in [0, x]. 
Repeat Exercise 9 for the surface 
X =sinu, y=u, Z = COSV 
for 0 < v < 2x and —1 <u < 3. 
Repeat Exercise 9 for the surface 
x =(2—coSv)cosu, y=(2—cosv)sinu, z=sinv 


for =x <u <2,—m < v < m. ls this surface regular? 


(a) Develop a formula for the plane tangent to the 
surface x = h(y, 2). 


(b) Obtain a similar formula for y = k(x, z). 


Find the equation of the plane tangent to the surface 
X =u2,y =v2,z = U? + v? atthe pointu = 1, v = 1. 


Find a parametrization of the surface z = 3x? + 8xy and 
use it to find the tangent plane at x = 1, y = 0, z = 3. 
Compare your answer with that using graphs. 


Find a parametrization of the surface 

x3 + 3xy +z? =2, z > 0, and use it to find the tangent 
plane at the point x =1, y = 1/3, z = 0. Compare your 
answer with that using level sets. 


Consider the surface in R? parametrized by 


&æ(r, 0) = (r coso, r sind, 8), 
and 0 <09 < 4r. 


0<r<1 


(a) Sketch and describe the surface. 
(b) Find an expression for a unit normal to the surface. 


(c) Find an equation for the plane tangent to the surface 
at the point (Xo, Yo, Zo). 

(d) If (Xo, Yo, Zo) is a point on the surface, show that the 
horizontal line segment of unit length from the z 
axis through (Xo, Yo, Zo) is contained in the surface 
and in the plane tangent to the surface at 
(xo, Yo, Zo). 


18. Given a sphere of radius 2 centered at the origin, find the 


equation for the plane that is tangent to it at the point 
(1, 1, /2) by considering the sphere as: 


(a) a surface parametrized by 
(0, $) = (2 cos sin o, 2sin 8 sin ġ, 2 coso); 
(b) a level surface of f(x, y, z) = xX? + y? + z2; and 
(c) the graph of g(x, y) = 4/4 — x? — y2. 
19. (a) Find a parametrization for the hyperboloid 
x2 +y? —2? =25. 
(b) Find an expression for a unit normal to this surface. 


(c) Find an equation for the plane tangent to the surface 


at (xo, yo, 0), where xf + yg = 25. 

(d) Show thatthe lines (xo, yo, 0) + t(—yo, Xo, 5) and 
(Xo, yo, 0) +t( yo, —Xo, 5) lie in the surface and in 
the tangent plane found in part (c). 


20. A parametrized surface is described by a differentiable 
function ®: R? — R?. According to Chapter 2, the 
derivative should give a linear approximation that yields 
a representation of the tangent plane. This exercise 
demonstrates that this is indeed the case. 


(a) Assuming Ty x T, + 0, show that the range of the 
linear transformation D®(uo, vo) is the plane 
spanned by Ty andT,.[HereTy and T, are 
evaluated at (Uo, vo).] 


Show that w L (Ty x T,) if and only if w is in the 
range of D®(ug, vo). 


Show that the tangent plane as defined in this 
section is the same as the “parametrized plane” 


(b 


(c 


(u, v) > ®(Up, vo) +D (uo, vo) k - pl 
v — vo 


21. Consider the surfaces ®1(u, v) = (u, v, 0) and 
(u, v) = (u3, v3, 0). 


(a) Show that the image of © and of ®2 is the xy plane. 


(b) Show that ® describes a regular surface, yet 2 
does not. Conclude that the notion of regularity of a 
surface S depends on the existence of at least one 
regular parametrization for S. 


(c) Prove that the tangent plane of S is well defined 
independently of the regular (one-to-one) 
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parametrization (you will need to use the inverse 
function theorem from Section 3.5). 


(d) After these remarks, do you think you can find a 
regular parametrization of the cone of Figure 7.3.7? 


22. The image of the parametrization 
(u,v) = (x(u, v), y(u, v), z(u, v)) 
= (a sinu cos v, b sinu sin v, c cosu) 


withb <a,0 <u <x,0 <v < 2m parametrizes an 
ellipsoid. 


(a) Show that all points in the image of © satisfy: 


2 2 2 
X y Z 
apg 
(the Cartesian equation of an ellipsoid). 


(b) Show that the image surface is regular at all points. 
23. The image of the parametrization 


(u, v) = (x(u, v), y(u, v), z(u, v)) 
= ((R +r cosu) cosv, (R +r cosu) sin v,r sinu) 


with 0 <u, v <2x,0 <r < 1 parametrizes a torus (or 
doughnut) S. 


(a) Show that all points in the image (x, y, z) satisfy: 


(\/x2 + y2 — RP +z? =r?. 


(b) Show that the image surface is regular at all points. 


24. Let ® bea regular surface at (uo, vo); that is, ® is of 
class C t and Ty xT, 40 at (Uo, vo). 


(a) Use the implicit function theorem (Section 3.5) to 
show that the image of ® near (Ug, vo) is the graph 
of aC ! function of two variables. If the z 
component of Ty x T, is nonzero, we can write it 
asz = f(x,y). 

Show that the tangent plane at ®(uo, vo) defined by 
the plane spanned by Ty and T, coincides with the 
tangent plane of the graph of z = f(x, y) at this 
point. 


(c 


Before proceeding to general surface integrals, let us first consider the problem of 
computing the area of a surface, just as we considered the problem of finding the arc 
length of a curve before discussing path integrals. 
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Definition of Surface Area 


In Section 7.3 we defined a parametrized surface S to be the image of a function 
®: D c R? = R?, written as ®(u, v) = (x(u, v), y(u, v), z(u, v)). The map ® was 
called the parametrization of S and S was said to be regular at ®(u, v) € S provided 
thatT, x T, 4 0, where 


OX oy . OZ 
T, 5y tu N+ zg D + 2 Cu, wk 
and 
OX oy . oZ 
T= on —(U, v)i i+ tu, oj + Cu, vk. 


Recall that a regular surface (loosely speaking) is one that has no corners or breaks. 
Inthe rest of this chapter and in the next one, we shall consider only piecewise regular 
surfaces that are unions of images of parametrized surfaces ®;: D; — IR? for which: 
(i) Dj is an elementary region in the plane; 
(ii) ©; is of class C 1 and one-to-one, except possibly on the boundary of D;; and 
(iii) Si, the image of ®;, is regular, except possibly at a finite number of points. 


Definition Area of a Parametrized Surface Wedefinethesurfacearea’® 
A(S) of a parametrized surface by 


as) = [f Ty xT, | du do, (1) 


where ||T, x T,|| isthe norm of Tu x T,. If S is a union of surfaces S;, its area is 
the sum of the areas of the S;. 


As you can easily verify, we have 


Say]? faya]? fax, 2) 7? 
mma xT. = [2] ae +e]. (2) 


where 


and so on. Thus, formula (1) becomes 
p= 2 falx, z)]? 
ais)= ff [2 + [Beat] +(e du dv. (3) 


10A s we have not yet discussed the independence of parametrization, it may seem that A(S) depends 
on the parametrization ®. We shall discuss independence of parametrization in Section 7.6; the use of 
this notation here should not cause confusion. 
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(xy Yip Zij) 


x 


figure 7.4.1 ||T,, x T, || A uAv is equal to the area of a parallelogram that approximates the 
area of a patch on a surface S= (D). 


Justification of the Area Formula 


We can justify the definition of surface area by analyzing the integral 
Sfp Tu x Ty] du dv in terms of Riemann sums. For simplicity, suppose D is a rect- 
angle; consider the nth regular partition of D, and let Ri be the i jth rectangle in the 
partition, with vertices (uj, vj), (Wisi, vj), (Ui, vj41), and (Uig Vj41),0 <i <n—1, 
0 < j <n — 1. Denote the values of Tu and T, at (ui, vj) by Ty, and T,,. We can think 
of the vectors AuTy, and AvT,, as tangent to the surface at ®(uj;, vj) = (Xij, Yij Zi), 
where Au = Uj41 — Uj, Av = vj 41 — vj. Then these vectors form a parallelogram Pj 
that lies in the plane tangent to the surface at (Xij, Yij, zj) (see Figure 7.4.1). We thus 
have a “patchwork cover” of the surface by the Pij. For n large, the area of Pij is a good 
approximation to the area of ®( Rj). Because the area of the parallelogram spanned by 
two vectors vı and vp is ||vi x v2|| (see Chapter 1), we see that 


A(Pij) = AUT., x AvT,, || = Ty, x Ty, || Au Av. 


Therefore, the area of the patchwork cover is 


n—-1 n-1 n—1 n-1 


An = A(Pi) = Ty, x Ty, || Au Av. 


ll 
So 
Il 
So 
ll 
So 
ll 
So 


Asn — ow, the sums A, converge to the integral 


n IT, x T,|| du dv. 
D 


Because A, should approximate the surface area better and better as n — oo, we are 
led to formula (1) as a reasonable definition of A(S). 


Let D be the region determined by 0 < 6 < 2x,0 <r < 1 and let the function 
©: D — R?, defined by 
X =r cosé, y =r sin9, Z=!, 


be a parametrization of a cone S (see Figure 7.3.8). Find its surface area. 
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In formula (3), 


A(X, y) ae -=r a = 
a(r,6) {sine rcose| 
a(y,z)  jsinð r coso 

= =en 6, 
wa lai o Ae 

and 

ə(x,z) jcosð -—rsine , 

= =r sinð, 
a(r, 8) | 1 0 | 


so the area integrand is 


IT, x Toll = Vr? +1r2cos26 + r2sin26 =r v2. 


Clearly, |T; x Tal] vanishes forr = 0, but (0, 6) = (0, 0, 0) for any @. Thus, (0, 0, 0) 
is the only point where the surface is not regular. We have 


2x 1 2a 
JI IT, x Tolldr do = | i Vir dr do = | 5V2d0 = v2n. 
D 0 0 0 


To confirm that this is the area of ®(D), we must verify that ® is one-to-one (for 
points not on the boundary of D). Let D° be the set of (r, 0) withO <r < 1 and 
0 < 6 < 2m. Hence, D° is D without its boundary. To see that 6: D° —> R? is 
one-to-one, assume that (r, 6) = ®(r’, 6’) for (r, @) and (r’, 6’) € D°. Then 


r cos =r’cosé’, rsin =r’sing’, r=r. 


From these equations it follows that cos@ = cos@’ and sing = sin 6’. Thus, either 
0 = 0' or 8 = 0' + 2xn. But the second case is impossible for n a nonzero integer, 
because both @ and @’ belong to the open interval (0, 27) and thus cannot be more than 
2x radians apart. This proves that off the boundary, ® is one-to-one. (Is ®: D —> R? 
one-to-one?) In future examples, we shall not usually verify that the parametrization is 
one-to-one when itis intuitively clear. A 


A helicoid is defined by ®: D — R?, where 
X =T cosé, y =r sin9, Z=0 


and D is the region where 0 < 6 < 2x and 0 <r < 1 (Figure 7.4.2). Find its area. 


figure 7.4.2 The helicoid x = rcosé, y = r sin0, Z = 8. 
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We compute 0(x, y)/a(r, 6) =r asin Example 1, and 


a(y,z) _ |sin@ rcosé 


= = sind, 
alr, 8) 1 | 
a(x,Z)  |cos@ —rsine 

= = cosd. 
a(r, 8) 0 1 | 


The area integrand is therefore V/r2 + 1, which never vanishes, so the surface is regular. 
The area of the helicoid is 


27 1 1 
J) IIT, xToldrdo = f f VI+ Idr do = 2x f yvr? + 1dr. 
D 0 0 0 


After a little computation (using the table of integrals), we find that this integral is 
equal to 


z[V2 + log(1+v2)]. à 


Surface Area of a Graph 


A surface S given in the form z=g(x, y), where (x, y)e D, admits the parametri- 
zation 


X =U, y=v, Zz = g(u, v) 


for (u, v) e D. When g is of class C +, this parametrization is smooth, and the formula 
for surface area reduces to 


OOR 


after applying the formulas 


. og ag 
T,=i+—k, T,=j+—k, 
Ty Izy 
and 
_ 99; _ 99, 39; _ 29 
T,xT, = Th zI tK ag ay +k: 


as noted in Example 4 of Section 7.3. 


Surfaces of Revolution 


In most books on one-variable calculus, itis shown that the lateral surface area generated 
by revolving the graph of afunction y = f(x) about the x axisis given by 


b 
kate (1 f (xX), /T FTP OOP) ax. (5) 
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If the graph is revolved about the y axis, the surface area is 


b 
A= 2x | (xyT#TP OOP) dx (6) 


We shall derive formula (5) by using the methods just developed; we can obtain formula 
(6) in asimilar fashion (Exercise 10). 

To derive formula (5) from formula (3), we must give a parametrization of S. Define 
the parametrization by 


X =U, y = f(u) cos», z= f(u)sinv 
over the region D given by 
a<u<b, O<v<2z. 
This is indeed a parametrization of S, because for fixed u, the point 
(u, f(u) cosy, f(u) sin v) 


traces out a circle of radius | f (u)| with the center (u, 0, 0) (Figure 7.4.3). 


We calculate 
a(x, y) = : a(y, zZ) 7 , a(x, z) 7 
a(u, v] = —f(u)sinv, oat = f (u) f(u), TEE f (u) cos v, 


and so by formula (3) 


B a(x,y)]? paoa faoz]? 
Ma =J) yin] al + Ea ai 
=I VITU sin o + [FU u] 4 [fucos v du dv 
D 


= ff itin L+ (ue dude 
D 
b 2n 
a. IfI +1 Pu) dv du 


b 
=2n | tu) T+ (udu 


which is formula (5). 


at 


figure 7.4.3 The curve y= f(x) rotated 
about the x axis. 


Circumference = 2n| fi (x)| 


Historical Note 
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If S is the surface of revolution, then 27| f (x)| is the circumference of the vertical 
cross section to S at the point x (Figure 7.4.3). Observe that we can write 


b 
2x | [EON VT TOR Ax = [2a f(d, 


where the expression on the right is the path integral of 27 |f (x)| along the path given 
by c: [a, b] > R?, t + (t, f(t)). Therefore, the lateral surface of a solid of revolution 
is obtained by integrating the cross-sectional circumference along the path that is the 
graph of the given function. 


The most famous mathematician in ancient times was Archimedes. In addition to 
being an extraordinarily gifted mathematician, he was also an engineering 
genius on a scale never before seen and was greatly admired by his 
contemporaries and by later writers for his insights into mechanics. It was these 
talents that helped the people of the city of Syracuse in 214 B.c. to defend their 
city against the onslaught of the Roman legions under their commander 
Marcellus. 

When the Romans besieged the city, they encountered an enemy whom 
Archimedes had supplied—totally unexpectedly—with powerful weapons, 
including artillery and burning mirrors, which, as legend has it, incinerated the 
Roman fleet. 

The siege of Syracuse lasted two years, and the city finally fell as a result of 
acts of treason. In the aftermath of the assault, the old scientist was slain by a 
Roman soldier, even though the commander had asked his men to spare 
Archimedes' life. As the story goes, Archimedes was sitting in front of his house 
studying some geometric figures he had drawn in the sand. When a Roman 
soldier approached, Archimedes shouted out, “Don’t disturb my figures!” The 
ruffian, feeling insulted, slew Archimedes. 

To honor this great man, Marcellus erected a tomb for Archimedes on which, 
according to Archimedes’ own wishes, were depicted a cone, a sphere, and a 
cylinder (Figure 7.4.4). 

Archimedes was incredibly proud of his calculation of the volume and surface 
area of the sphere, which justifiably were seen as truly outstanding 
accomplishments for their time. As in his works on centers of gravity, for which he 
provided no clear definition, Archimedes was able to compute the surface area 
of the sphere without having a clear definition of precisely what it was. However, 
as with many mathematical works, one knows the answer long before a proof or 
even the correct definition can be found. 

The problem of properly defining surface areas is a difficult one. To 
Archimedes’ credit, a careful theory of surface areas was not achieved until the 


figure 7.4.4 Archimedes’ theorem: The ratios of the volumes of a cone, a 
half ball, and a cylinder, all of the same height and radius, are 1:2:3. 
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twentieth century, after a long development that began in the seventeenth 
century with the discovery of calculus. 

Christiaan Huygens (1629-1695) was the first person since Archimedes to give 
results on the areas of special surfaces beyond the sphere, and he obtained the 
areas of portions of surfaces of revolution, such as the paraboloid and 
hyperboloid. 

The brilliant and prolific mathematician Leonhard Euler (1707-1783) presented 
the first fundamental work on the theory of surfaces in 1760 with Recherches sur la 
courbure des surfaces. However, it was in 1728, in a paper on shortest paths on 
surfaces, that Euler defined a surface as a graph z= f(x,y). Euler was interested 
in studying the curvature of surfaces, and in 1771 he introduced the notion of the 
parametric surfaces that are described in this section. 

After the rapid development of calculus in the early eighteenth century, 
formulas for the lengths of curves and areas of surfaces were developed. 
Although we do not know when all the area formulas presented in this section first 
appeared, they were certainly common by the end of the eighteenth century. 
The underlying concepts of the length of a curve and the area of a surface were 
understood intuitively before this time, and the use of formulas from calculus to 
compute areas was considered a great achievement. 

Augustin-Louis Cauchy (1789-1857) was the first to take the step of defining the 
quantities of length and surface areas by integrals as we have done in this book. 
The question of defining surface area independent of integrals was taken up 
somewhat later, but this posed many difficult problems that were not properly 
resolved until this century. 

The Spheres of Archimedes can be seen throughout nature, from the shapes of 
stars and planets to that of soap bubbles. Figure 7.4.5 shows a boy blowing a 
soap bubble. 

Bubble blowing is an old pastime. There is even an Etruscan Vase in the Louvre 
on which children are portrayed blowing bubbles. Have you ever wondered why 
soap bubbles are round and not cubical? Analogously, why are the planets and 
sun round? What really determines shape and form in our universe? 


figure 7.4.5 A boy blowing a soap bubble. Painting by 
the French artist Jean Baptiste Siméon Chardin 
(1699-1779), The Metropolitan Museum of Art, New York. 
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Answers to these questions involve central concepts in our book, namely, 
problems of maxima and minima, and in the case of soap bubbles problems of 
area and volume. Soap bubbles are round because nature is economical. The 
spherical shape is the shape of the unique surface of least possible area 
containing a fixed volume (which in the case of a bubble is air). 

Mathematical problems of this kind belong to a subject called The Calculus of 
Variations, a subject almost as old as The Calculus itself. 

For more information, consult our Internet supplement or the book The 
Parsimonious Universe by Hildebrandt and Tromba (Springer-Verlag, 1995). 


exercises 


1 


2. 


3. 


Find the surface area of the unit sphere S represented 
parametrically by ®: D —> S c R?, where D is the 
rectangle 0 < 0 < 2x, 0 < @ < x and ® is given by the 
equations 

x = coso sino, y =sinésing, Z=COS¢. 
Note that we can represent the entire sphere 
parametrically, but we cannot represent it in the form 
z= f(x,y). 


In Exercise 1, what happens if we allow ¢ to vary from 
—z/2 to 1/2? From 0 to 2x? Why do we obtain 
different answers? 


Find the area of the helicoid in Example 2 if the domain 
DisO<r <land0 <6 < 3x. 


4. The torus T can be represented parametrically by the 


function ®: D — R3, where @ is given by the 
coordinate functions x = (R + cos) cosé, 

y = (R + coso) sing, z =sing; D is the rectangle 

[0, 27] x [0, 27], that is, 0 < 0 < 2x, 0 < ¢ < 2x; and 
R > Lis fixed (see Figure 7.4.6). Show that 

A(T) =(2z)2R, first by using formula (3) and then by 
using formula (6). 


figure 7.4.6 A cross section of a torus. 


10. 


11. 


12. 


13. 


Let &(u, v) = (e" cos v, e" sin v, v) be a mapping from 
D =[0, 1] x [0, x] in the uv plane onto a surface S in 
xyz space. 


(a) Find Tu x Ty. 


(b) Find the equation for the tangent plane to S when 
(u, v) = (0, 5). 
(c) Find the area of &(D). 


Find the area of the surface defined by z = xy and 
x? +y? <2. 


Use a surface integral to find the area of the triangle T in 
R3 with vertices at (1, 1, 0), (2, 1, 2), and (2, 3, 3). 
Verify your answer by finding the lengths of the sides 
and using classical geometry. 

[HINT: Write the triangle as the graph z = g(x, y) over a 
triangle T * in the xy plane.] 


U se a surface integral to find the area of the quadrilateral 
D in R3 with vertices at (—1, 1, 2), (1, 1, 2), (0, 3, 5), 
and (5, 3, 5). Verify your answer by finding the lengths 
of the sides and using classical geometry. [HINT: See the 
hint in the previous problem. ] 


Let @(u, v) = (u — v, u + v, Uv) and let D be the unit 
disc in the uv plane. Find the area of ®(D). 


Find the area of the portion of the unit sphere that is cut 
out by the cone z > ,/x2 + y? (see Exercise 1). 


Show that the surface x = 1/,/y2 + z2, where 
1 <x < co, can be filled but not painted! 


Find a parametrization of the surface x? — y? = 1, 
where x > 0, —1 < y < land0 < z < 1. Use your 
answer to express the area of the surface as an integral. 


Represent the ellipsoid E : 


2 2 2 
X yo z 
ogee 
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14, 


15. 


16. 


17. 


18. 


19. 


20. 


21. 


22. 


23. 
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parametrically and write out the integral for its surface 
area A(E ). (Do not evaluate the integral.) 


Let the curve y = f(x),a < x < b, be rotated about the 
y axis. Show that the area of the surface swept out is 
given by equation (6); that is, 


b 
A =2x | IX|4/1 + [ f (x) ]? dx. 
a 


Interpret the formula geometrically using arc length and 
slant height. 


Find the area of the surface obtained by rotating the 
curve y = x2, 0 < x < 1, about the y axis. 


U se formula (4) to compute the surface area of the cone 
in Example 1. 


Find the area of the surface defined by 
x+y+2=1,x? + 2y? <1. 


Show that for the vectors Ty and T,,, we have the 
formula 


mora- yae + fae * Gea) 


Compute the area of the surface given by 
X =r cosé, y = 2r cos9, Z= 0) 
0<r <1, 0<0 <2nz. 

Sketch. 


Prove Pappus’ theorem: Let c: [a,b] > R? beaC! 
path whose image lies in the right half plane and is a 
simple closed curve. The area of the lateral surface 
generated by rotating the image of ¢ about the y axis is 
equal to 27x!(c), where x is the average value of the x 
coordinates of points on c and I (c) is the length of c. 
(See Exercises 16 to 19, Section 7.1, for a discussion of 
average values.) 


The cylinder x? + y? = x divides the unit sphere S into 
two regions Si and S2, where Sy is inside the cylinder 
and Sz outside. Find the ratio of areas A(S2)/A(S:). 


Suppose a surface S that is the graph of a function 

z = f(x, y), where (x, y) € D c R? can also be 
described as the set of (x, y, z) € R? with 

F (x, y, z) = 0 (a level surface). Derive a formula for 
A(S ) that involves only F . 


Calculate the area of the frustum shown in Figure 7.4.7 
using (a) geometry alone and, second, (b) a surface-area 
formula. 


24. 


25. 


26. 


27. 


>'i 


y=mx+q 


a 


figure 7.4.7 A line segment revolved 
around the y axis becomes a frustum of a 
cone. 


A cylindrical hole of radius 1 is bored through a solid 
ball of radius 2 to form a ring coupler, as shown in 
Figure 7.4.8. Find the volume and outer surface area of 
this coupler. 


figure 7.4.8 Find the outer surface area 
and volume of the shaded region. 


Find the area of the graph of the function f(x, y) = 
3(x3/2 + y3/2) that lies over the domain [0, 1] x [0, 1]. 


Express the surface area of the following graphs over the 
indicated region D as a double integral. Do not evaluate. 
(a) (x +2y)?; D =[-1, 2] x [0, 2] 

(b) xy+x/(y+1);D =[1, 4] x [1, 2] 

(c) xy3eX’Y’; D = unit circle centered at the origin 


(d) y3cos? x; D = triangle with vertices 
(—1, 1), (0, 2), and (1, 1) 


Show that the surface area of the upper hemisphere of 


radius R, z = ,/R2 — x2 — y2, can be computed by 


formula (4), evaluated as an improper integral. 
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7.5 Integrals of Scalar Functions Over Surfaces 


Now we are ready to define the integral of a scalar function f over a surface S. 
This concept is a natural generalization of the area of a surface, which corresponds to 
the integral over S of the scalar function f(x, y,z) = 1. This is quite analogous to 
considering the path integral as a generalization of arc length. In the next section we 
shall deal with the integral of a vector function F over a surface. These concepts will 
play a crucial role in the vector analysis treated in the final chapter. 

Let us start with a surface S parametrized by a mapping ®: D —> S c R?, where D 
is an elementary region, which we write as ®(u, v) = (x(u, v), y(u, v), z(u, v)). 


Definition The Integral of a Scalar Function Over a Surface 
If f (x, y, z) isa real-valued continuous function defined ona parametrized surface 
S, we define the integral of f over S to be 


[ffooyads= ff tas= ff tou, wits x tuidudy. 0 


Written out, equation (1) becomes 


7 ax, y)? faoz]? az]? 
[fras= ff Falun tuna, [EE] +(e + [Fea dudv. (2) 


figure 7.5.1 ® takes a portion Ry 
of D to a portion of S. 


Thus, if f is identically 1, we recover the area formula (3) of Section 7.4. Like surface 
area, the surface integral is independent of the particular parametrization used. This will 
be discussed in Section 7.6. 

We can gain some intuitive knowledge about this integral by considering it as a limit 
of sums. Let D bea rectangle partitioned into n? rectangles Rj with areas Au Av. Let 
Si; = ®(Rj;) be the portion of the surface (D ) corresponding to Rij (see Figure 7.5.1), 
and let A(Sj) be the area of this portion of the surface. For large n, f will be approxi- 
mately constant on Sij, and we form the sum 


Sp = f (@(uj, vj)) A(Sj), (3) 


u 
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example | 


solution 


where (uj, vj) € Rij. From Section 7.4 we have a formula for A(Sj)): 
= IT, x T,ldu dv, 
Rij 


which, by the mean-value theorem for integrals, equals ||Tu; x T, || Au Av for some 
point (už, vj) in Ry. Hence, our sum becomes 


n-1 n-1 
f(®(uj, vj)) [Tus x Ty ||Au Av, 
i=0 j=0 


which is an approximating sum for the last integral in formula (1). Therefore, 


limits =f tdS. 
n—-oo S 


Note that each term in the sum in formula (3) is the value of f at some point 
(uj, vj) times the area of Sj. Compare this with the Riemann-sum interpretation of 
the path integral in Section 7.1. 

If S is a union of parametrized surfaces S;, i = 1,..., N, that do not intersect 
except possibly along curves defining their boundaries, then the integral of f over S is 


defined by 
N 
[ftds=> ff tas, 
S jaa 7 Y Si 


as we should expect. For example, the integral over the surface of a cube may be 
expressed as the sum of the integrals over the six sides. 


Suppose a helicoid is described as in Example 2, Section 7.4, and let f be given by 


f(x, y, Z) = yX? + y? +1. Find ff, fds. 


Asin Examples 1 and 2 of Section 7.4, 


= sind, 


Also, f(r cosé,r sing, 0) = vr? + 1. Therefore, 


[f fooyards = ff re (r,0))||T, x Toll dr de 


20 2x 
a [ VIV + tarde = | 30 = Sx. 
0 0 0 


3 


Surface Integrals Over Graphs 


Suppose S is the graph of aC! function z = g(x, y). Recall from Section 7.4 that we 
can parametrize S by 


X =U, y=v, Zz=g(u,v), 


example 2 


solution 


example 3 


solution 
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and that in this case 


_ ag 2 ag 2 
Ty x Tol] = ys (3) + (2) 1 

ag ag \° 
[ffeuads= ff tiyay 1+ (3 V + (8) aay. (4) 


Let S be the surface defined by z = x? + y, where D is the region 0 < x < 1, 
—l<y <1. Evaluate ff; x dS. 


sO 


If weletz = g(x, y) = x? + y, formula (4) gives 


2 2 1 1 
Jiris- ffoi (2) wl + (3) dxdy = | | xv 1+ 4x2 + ldxdy 
5 -1 J0 
1 
-if fi 244x a [(2 + 4x?)?2]]} dy 
=Ï 


Wp 
ne 3/2 _ 93/2 
-5 aK 23/2) dy 


(63/2 — 23/2) — /6 — a 
z va( v3- 5) 
3 A 


E 


Evaluate ff z?dS, where S is the unit sphere x? + y? +z? = 1. 


For this problem, itis convenient to use spherical coordinates and to represent the sphere 
parametrically by the equations x = cos@sing, y = sin sin, z = cos, over the 
region D in the @¢@ plane given by the inequalities 0 < ¢ < 7,0 < 0 < 2x. From 


equation (1) we get 
Jj zas = J| osor, x Told dọ. 
S D 


A little computation [use formula (2) of Section 7.4; see Exercise 12] shows that 
To x Toll = sing. 
(Note that for 0 < @ < x, we have sing > 0.) Thus, 


2r 1 
[[eas= | f cos’ osing dodo 
S 0 0 


1 2x à 2 2x 4r 
= — — z = — ĝ = —. 
ai [—cos’*]j d 3 |, d 5 r 


This example also shows that on a sphere of radius R, 


27 m 
[ft f flo, 0)R?sing do d9, 
S 0 0 
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or, for short, the area element on the sphere is given by 


dS =R? sing dodo. 


Integrals Over Graphs 


We now develop another formula for surface integrals when the surface can be repre- 
sented as a graph. To do so, we let S be the graph of z = g(x, y) and consider formula 
(4). We claim that 


i f(x,y,z) as= ff (X, Tey ghey) EGY GUCM ody (5) 


where @ is the angle the normal to the surface makes with the unit vector k at the point 
(x, y, g(x, y)) (see Figure 7.5.2). Describing the surface by the equation (x, y, z) = 
Z — g(x, y) = 0, anormal vector N is V@; that is, 


== +k (6) 


[see Example 4 of Section 7.3, or recall that the normal to a surface with equation 
g(x, y, z) = constant is given by Vg]. Thus, 


N-k 1 
IN| \/(ag/ax)2 + (ag/ay)? + 1 


Substitution of this formula into equation (4) gives equation (5). Notethatcos@ = n-k, 
where n = N/||N|j is the unit normal. Thus, we can write 


dx dy 
n-k’ 


cosé = 


dS = 


The result is, in fact, obvious geometrically, for if a small rectangle in the xy plane 
has area AA, then the area of the portion above it on the surface is AS = AA/cosé 
(Figure 7.5.2). This intuitive approach can help us to remember formula (5) and to apply 
itin problems. 


zZ 


AS cos 0 = AA 


figure 7.5.2 The area of a patch of area 
AS over a patch A Ais AS = A4/cos9, 
where @ is the angle the unit normal n 
makes with k. 


example 4 


solution 


example 5 


solution 
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Compute ff x dS, where S is the triangle with vertices (1, 0, 0), (0, 1, 0), (0, 0, 1) (see 
Figure 7.5.3). 


figure 7.5.3 In computing a specific surface 
integral, we find a formula for the unit normal 
n and computes the angle @ in preparation for 
formula (5). 


This surface is the plane described by the equation x + y +z = 1. Because the surface 
isa plane, the angle 6 is constant and a unit be vector isn = (1/./3, 1/73, 1/73). 
Thus, cos@ = n- k = 1/,/3, and by equation (5 


ase) aa 


where D is the domain in the xy plane. But 


V3 | f xayf [navn = V3 f a-t. - 


Integrals of functions over surfaces are useful for computing the mass of a surface 
when the mass density function m is known. The total mass of a surface with mass 
density (per unit area) m is given by 


= ff mex, y, z) dS. (7) 


Let #: D — R? be the parametrization of the helicoid S = ®(D) of Example 2 
of Section 7.4. Recall that ®(r, 0) = (rcosé,rsiné@, 6), where 0 < 6 < 2x, and 
0 <r <1. Suppose S has a mass density at (x, y, z) € S equal to twice the distance of 
(x, y, z) from the central axis (see Figure 7.4.2), thatis, m(x, y, z) = 2,/x2 + y? =2r, 
in the cylindrical coordinate system. Find the total mass of the surface. 


Applying formula (7), 


= [[2veryas = |f 2 ds = | 2r IT, x Toli dr dé. 
S D D 


From Example 2 of Section 7.4, we see that ||T; x T,|| = v1 +r?. Thus, 


2x 1 
= a Vis rdr de = | J avivrarae 
D 0 0 


alt ! an 4r 
= “(aera a= | — (27? =1) de = — (27 — 1). 
fo Garer] -ndas E-n 
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exercises 


1 


Evaluate the integral of the function f(x, y,z) =X +y 
over the surface S given by: 


(u, v) =(2ucosv, 2usinv, u), u €[0, 4], v € [0, x] 


» Evaluate the integral of the function f(x, y,z) =z+6 


over the surface S given by: 


blu, v) = (u, £, v), 


3 u e€ [0, 2], v € [0, 3]. 


. Evaluate the integral 


f|- +aas, 
S 


where S is the portion of the plane 2x + 3y + z = 6 that 
lies in the first octant. 


. Evaluate the integral 


[ferns 
S 


where S is the part of the cylinder y? + z? = 4 with 
x e [0,5]. 


» Let S be the surface defined by 


(u, v) = (u + v, U — v, UD). 


(a) Show that the image of S is in the graph of the 
surface 4z = x? — y?. 

(b) Evaluate ffs x dS for all points on the graph S, over 
x+y? <1. 


» Evaluate the integral 


[fv + yz) dS, 
S 


where S is the part of the plane z = 4 + x + y that lies 
inside the cylinder x2 + y? = 4. 


. Compute ff; xy dS, where S is the surface of the 


tetrahedron with sides z = 0, y = 0, x +z = 1, and 
x=y. 


. Evaluate ff; xyzdS, where S is the triangle with 


vertices (1, 0, 0), (0, 2, 0), and (0, 1, 1). 


. Evaluate ff; z dS, where S is the upper hemisphere of 


radius a, that is, the set of (x, y, z) with 
z= ya? x? — y?, 


10. 


11. 


12. 


13. 


14, 


15. 


16. 


17. 


18. 


Evaluate ffs (x + y +z) dS, where S is the boundary of 
the unit ball B; that is, S is the set of (x, y, z) with 

x? + y? + z? = 1. (HINT: Use the symmetry of the 
problem.) 


(a) Compute the area of the portion of the cone 
x2 + y? = z? with z > 0 that is inside the sphere 
x? + y? +z? = 2Rz, where R is a positive constant. 
(b) What is the area of that portion of the sphere that is 
inside the cone? 


Verify that in spherical coordinates, on a sphere of 
radius R, 


Ty x Tolldødo = R? sing do dð. 


Evaluate Ms zdS, where S is the surface 
z=x?+y4,x24y2 <1, 


Evaluate the surface integral ffo z? dS, where S is the 
boundary of the cube C = [—1, 1] x [-1, 1] x [-1, 1]. 
(HiNT: Do each face separately and add the results.) 


Find the mass of a spherical surface S of radius R such 
that at each point (x, y, z) € S the mass density is equal 
to the distance of (x, y, z) to some fixed point 

(Xo, Yo, Zo) € S. 


A metallic surface S is in the shape of a hemisphere 

Z= \/R2 — x? — y2, where (x, y) satisfies 

0 < x? +y? < R2. The mass density at (x, y, z) € S is 
given by m(x, y, z) = x2 + y2. Find the total mass of S. 


Let S be the sphere of radius R. 
(a) Argue by symmetry that 


[[eas= [f Pas= ffas. 


(b) Use this fact and some clever thinking to evaluate, 
with very little computation, the integral 


[le 


(c) Does this help in Exercise 16? 


(a) Use Riemann sums to justify the formula 


I 
aay | f e928 


for the average value of f over the surface S. 


19. 


20. 


21. 


22. 


23. 


(b) In Example 3 of this section, show that the average 
of f(x, y, Z) = z? over the sphere is 1/3. 

(c) Define the center of gravity (x, y, Z) of a surface S 
to be such that x, y, and Ž are the average values of 
the x, y, and z coordinates on S. Show that the 
center of gravity of the triangle in Example 4 of this 
section is (}, $, 4). 


Find the average value of f(x, y,z) =x + z2 on the 
truncated cone z? = x? + y?, with 1 < z < 4. 


Evaluate the integral 


f[u-aas, 
S 


where S is the graph of z = 1 — x? — y2, with 
x+y? <1. 
Find the x, y, and z coordinates of the center of gravity 


of the octant of the solid sphere of radius R and centered 
at the origin determined by x > 0, y > 0, z > 0. (HINT: 
Write this octant as a parametrized surface— see 
Example 3 of this section and Exercise 18.) 


Find the z coordinate of the center of gravity (the average 
z coordinate) of the surface of a hemisphere (z < 0) 
with radius r (see Exercise 18). Argue by symmetry that 
the average x and y coordinates are both zero. 


Let @: D c R? —> R? bea parametrization of a surface 

S defined by 
x=x(u, v), y=y(u,v), Z=2Z(u,v). 

(a) Let 
a@ (əx dy az hä a@ (əx dy az 
au \ au’ au’ au ðv \ du’ du’ ðv 


thatis, ọ@®/əðu = Ty and d®/dv = Ty, and set 


® |/2 ® |/2 
E= |2 6=|— 
ou dv 


Show that 
VEG =F? = ||Tu x Tyll, 


_ 3d a® 
~ au “au 


24, 


25. 


26. 
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and that the surface area of S is 
ais) = f) VEG —F2dudv. 
D 


In this notation, how can we express f fg f dS for a 
general function of f ? 

(b) What does the formula for A(S) become if the 
vectors d@/du and 0®@/dv are orthogonal? 

(c) Use parts (a) and (b) to compute the surface area of 
a sphere of radius a. 


Dirichlet’s functional for a parametrized surface 
®: D —> R? is defined by! 


J (8) = shh ( (ls I+ 


Use Exercise 23 to argue that the area A(®) < J (®) 
and equality holds if 


a lal =la 


Compare these equations with Exercise 23 and the 
remarks at the end of Section 7.4. A parametrization ® 
that satisfies conditions (a) and (b) is said to be 
conformal. 


28 A) aude, 


o% ® 
and (b) — oe 
ðu ðv 


Let D c R? and ®: D —> R? be a smooth function 


(u, v) = (x(u, v), y(u, v)) satisfying conditions (a) 
and (b) of Exercise 16 and assume that 
ax aX 
det au BY > 0. 
oy. ay 
ou ðv 


Show that x and y satisfy the Cauchy- Riemann 
equations 3x /du = 3y /ðv, ƏX/ðv = —dy/du. 
Conclude that V2 = 0 (i.e., each component of ® is 
harmonic). 


Let S be a sphere of radius r and p bea point inside or 
outside the sphere (but not on it). Show that 
1 dS = 4rr if pisinside S 
sIx=p ` \4rr?/d if pisoutside S, 


11D irichlet's functional played a major role in the mathematics of the nineteenth century. The mathematician Georg Friedrich B ernhard Riemann 
(1826-1866) used it to develop his complex function theory and to give a proof of the famous Riemann mapping theorem. Today it is still used 
extensively as a tool in the study of partial differential equations. 
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where d is the distance from p to the center of the sphere 28. Leta surface S be defined implicitly by F (x, y, z) = 0 
and the integration is over the sphere. [HINT: Assume p for (x, y) in a domain D of R?. Show that 
is on the z-axis. Then change variables and evaluate. 
Why is this assumption on p justified?] 1 E] ds 
0z 
27. Find the surface area of that part of the cylinder > 5 5 5 
x? +z? = a? thatis inside the cylinder x2 + y? = 2ay m I Ë ) + (= ) fe (+ ) dx dy. 
and also in the positive octant (x > 0, y > 0, z > 0). D ax ay az 
Assumea > 0. 


Compare with Exercise 22 of Section 7.4. 


7.6 Surface Integrals of Vector Fields 


The goal of this section is to develop the notion of the integral of a vector field over a 
surface. Recall that the definition of the line integral of a vector field was motivated by 
the fundamental physical notion of work. Similarly, there is a basic physical notion of 
flux that motivates the definition of the surface integral of a vector field. 

For example, if the vector field is the velocity field of a fluid (perhaps the velocity 
field of a flowing river), and we put an imagined mathematical surface into the fluid, 
we can ask: “W hat is the rate at which fluid is crossing the given surface (measured in, 
say, cubic meters per second)?” The answer is given by the surface integral of the fluid 
velocity vector field over the surface. 

We shall come back to the physical interpretation shortly and reconcile it with the 
formal definition that we give first. 


Definition of the Surface Integral 


We now define the integral of a vector field, denoted F, over a surface S. We first 
give the definition and later in this section give its physical interpretation. This can 
also be used as a motivation for the definition if you so desire. Also, we shall start 
with a parametrized surface ® and later study the question of independence of 
parametrization. 


Definition The Surface Integral of Vector Fields Let F be a vector field 
defined on S, the image of a parametrized surface ®. The surface integral of F 


over ®, denoted by 
I F- dS, 
® 
is defined by (see Figure 7.6.1) 


[[Fos= [fem x To) du dv. 


figure 7.6.1 The geometric 
significance of F- (Ty x T,). 


example | 


solution 
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< 


Let D be the rectangle in the @¢ plane defined by 
0<6<02zn, 0<¢<7, 
and let the surface S be defined by the parametrization @: D —> R? given by 
x = coso sing, y =sinésing, Z=CcOS¢. 


(Thus, 6 and ¢ are the angles of spherical coordinates, and S is the unit sphere para- 
metrized by ®.) Let r be the position vector r(x, y,z) = xi + yj + zk. Compute 


Sgt dS. 


First we find 


T, =(—singsin6)i + (singcose)j 
T, = (cosécos¢)i + (sin@ cos ¢)j — (sin ¢)k, 
and hence 


To x Ty = (—sin’¢ cos 6)i — (sin? sin @)j — (sind cos @)k. 


Then we evaluate 


r- (To x Tg) xi + yj + zk) - (To x Tg) 
(coso sing)i+ (sin@ sing)j + (cos ¢)k] 
-(—sin d)[(sind cos @)i + (sing sin@)j + (cos d)k] 


= (—sin¢)(sin? ¢ cos? 0 + sin? ¢ sin? 6 + cos’ o) = —sing. 


2x 
Jfr as=[f -singdede = | (—2)d6 = —4z. A 
® D 0 


Orientation 


An analogy can be drawn between the surface integral ffẹ F -dS and the line integral 
JF -ds. Recall that the line integral is an oriented integral. We needed the notion of 
orientation of a curve to extend the definition of f. F -dsto line integrals fẹ F - ds over 
oriented curves. We extend the definition of [fẹ F -dS to oriented surfaces in a similar 
fashion; that is, given a surface S parametrized by a mapping ®, we want to define 
JJF -dS = ffa F-dS and show that it is independent of the parametrization, except 
possibly for the sign. To accomplish this, we need the notion of orientation of a surface. 


= ( 
=[ 


Thus, 
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figure 7.6.2 The two possible unit 
normals to a surface at a point. 


figure 7.6.4 Ants walking on a 
Möbius strip. 


Definition Oriented Surfaces An oriented surface is a two-sided surface 
with one side specified as the outside or positive side; we call the other side the 
inside or negative side.!? At each point (x, y, z) € S there are two unit normal 
vectors nı and nz, where ny = —n; (see Figure 7.6.2). Each of these two normals 
can be associated with one side of the surface. Thus, to specify a side of a surface 
S, at each point we choose a unitnormal vector n that points away from the positive 
side of S at that point. 


This definition assumes that our surface does have two sides. In fact, this is necessary, 
because there are examples of surfaces with only one side! The first known example 
of such a surface was the M obius strip (named after the German mathematician and 
astronomer A. F. Mobius, who, along with the mathematician J. B. Listing, discovered 
itin 1858). Pictures of such a surface are given in Figures 7.6.3 and 7.6.4. At each point 
of M there are two unit normals, nı and n2. However, nı does not determine a unique 
side of M , and neither does np. To see this intuitively, we can slide nz around the closed 
curve C (Figure 7.6.3). When np returns to a fixed point p on C it will coincide with 
nı, showing that both n; and nz point away from the same side of M and, consequently, 
that M has only one side. 

Figure 7.6.4 is a M Obius strip as drawn by the well-known twentieth-century mathe- 
matician and artist M . C. Escher. It depicts ants crawling along the M obius band. A fter 


M 


figure 7.6.3 The Möbius strip: Slide nz around C 
once; when nz returns to its initial point, it will 
coincide with nı = —n,. 


L.We use the term “side” in an intuitive sense. This concept can be developed rigorously, but this will 
not be done here. A Iso, the choice of the side to be named the “outside” is often dictated by the surface 
itself, as, for example, is the case with a sphere. In other cases, the naming is somewhat arbitrary (see 
the piece of surface depicted in Figure 7.6.2, for instance). 


example 2 
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one trip around the band (without crossing an edge) they end up on the “opposite side” 
of the surface. 

Let : D — R? bea parametrization of an oriented surface S and suppose S is 
regular at ®(Uo, vo), (Uo, vo) € D; thus, the vector (Ty, x Ti) /Il Tuy x Tull is defined. 
If n( ®(Uo, vo)) denotes the unit normal to S at ®(Uo, vo), it follows that 


(Tu, x Tio) /lTu x Tall = =n( ®(U, vo)). 


The parametrization ® is said to be orientation-preserving if we have the + sign; 
thatis, if (Ty x T,)/|Tu x T, || = n(®(u, v)) atall (u, v) e D for which S is smooth at 
®(u, v). In other words, ® is orientation-preserving if the vector Ty x T, points to the 
outside of the surface. If Tu x T, points to theinsideof thesurfaceatall points (u, v) € D 
for which S is regular at ®(u, v), then @ is said to be orientation-reversing. Using the 
preceding notation, this condition corresponds to the choice (Tu x T,)/|Ty x Ty] = 
—n(®(u, v)). 

It follows from this discussion that the Mobius band M cannot be parametrized 
by a single parametrization for which n = Ty x T, 4 0 and n is continuous over 
the whole surface! (if there were such a parameterization, then M would indeed have 
two sides, one determined by n and one determined by —n). The sphere in Example 
1 can be parametrized by a single parametrization, but not by one that is everywhere 
one-to-one— see the discussion at the beginning of Section 7.4. 

Thus, any one-to-one parametrized surface for which Ty x T, never vanishes can 
be considered as an oriented surface with a positive side determined by the direction of 
Ty x Ta 


We can give the unit sphere x? + y? + z? = 1 in R? (Figure 7.6.5) an orientation by 
selecting the unit vector n(x, y, z) = r, where r = xi + yj + zk, which points to the 
outside of the surface. This choice corresponds to our intuitive notion of outside for the 
sphere. 

Now that the sphere S is an oriented surface, consider the parametrization ® of S 
given in Example 1. The cross product of the tangent vectors Tọ and T ,—that is, a 
normal to S is given by 


(—sind)[(cosé sin f)i + (sin@ sind)j + (cosd)k] = —rsing. 


figure 7.6.5 The unit sphere oriented by its 
(0, 1, 0) outward normal n. 


y 


13T here is a single parametrization obtained by cutting a strip of paper, twisting it, and gluing the ends, 
but it produces a discontinuous n on the surface. 
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example 3 


Because —sin < 0 for 0 < ¢ < x, this normal vector points inward from the sphere. 
Thus, the given parametrization ® is orientation-reversing. By swapping the order of 6 
and ¢, we would get an orientation-preserving parametrization. A 


Orientation and the Vector Surface Element 
of a Sphere 


Consider the sphere of radius R, namely, x? + y? + z? = R2. It is standard practice 
to orient the sphere with the outward unit normal. In terms of the position vector 
r = Xİ + yj + zk, the outward unit normal is given by 


The order of spherical coordinates that goes along with this orientation, as is evident 
from Example 2, is given by the order (¢, 6). The computation in Example 2 shows that 
the surface-area element is then given by 


dS =n- (T, x To) døde =rR sing do d8 = nR? sing do dê. 


The Orientation of Graphs 


The next example discusses the orientation conventions for graphs. We shall compute 
the area element on graphs later in this section. 


Let S be a surface described by z = g(x, y). As in equation (6), Section 7.5, there are 
two unit normal vectors to S at (Xo, Yo, 9(Xo, Yo)), namely, +n, where 
ag 


. 0 ‘ 
ay Yo)i — ay Xo Yo)j +k 


2 2 i 
| [Ze wo) + Fae wo) +1 


Wecan orientall such surfaces by taking the positiveside of S to bethe side away from 
which n points (Figure 7.6.6). Thus, the positive side of such a surface is determined 
by the unit normal n with positive k component— that is, it is upward-pointing. If 
we parametrize this surface by ®(u, v) = (u, v, g(u, v)), then ® will be orientation- 
preserving. 


figure 7.6.6 n points away from the outside of the 
surface. 
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Independence of Parameftrization 


We now state without proof a theorem showing that the integral over an oriented surface 
is independent of the parametrization. The proof of this theorem is analogous to that 
of Theorem 1 (Section 7.2); the heart of the proof is again the change of variables 
formula— this time applied to double integrals. 


Theorem 4 Independence of Surface Integrals on Parametrizations 
Let S be an oriented surface and let ®, and ®, be two regular orientation- 
preserving parametrizations, with F a continuous vector field defined on S. Then 


II pedes (H) feels 
®) ®) 


If © is orientation-preserving and ® orientation-reversing, then 


| F-ds=- f F-dS. 
®) ®2 


If f is a real-valued continuous function defined on S, and if ®, and ®> are 


parametrizations of S, then 
l fdS = / fds. 
®) ®) 


Note that if f = 1, we obtain 


as)= ff as= ff as, 


thus showing that area is independent of parametrization. 
We can therefore unambiguously use the notation 


[[F-as= ff Fas 


(or a sum of such integrals, if S is a union of parametrized surfaces that intersect only 
along their boundary curves) where ® is an orientation-preserving parametrization. 
Theorem 4 guarantees that the value of the integral does not depend on the selection 
of ®. 


Relation with Scalar Integrals 


Recall from formula (1) of Section 7.2 that a line integral f. F -ds can be thought of 
as the path integral of the tangential component of F along c (although for the case 
in which c intersects itself, the integral obtained is technically not a path integral). A 
similar situation holds for surface integrals, because we are assuming that the mappings 
® defining the surface S are one-to-one, except perhaps on the boundary of D, which 
can be ignored for the purposes of integration. Thus, in defining integrals over surfaces, 
we assume in this book that the surfaces are nonintersecting. 
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For an oriented smooth surface S and an orientation-preserving parametrization ® of 
S, we can express ff; F -dS as an integral of a real-valued function f over the surface. 
Letn = (T, x T,)/||Ty x T,]|| be the unit normal pointing to the outside of S. Then 


[freas= [f F-as= |f tu xtdude 
= (ee) Te x Tatu de 
= ffe-min, xTuidudv= [f(F-mas = ff tas, 


where f = F -n. We have thus proved the following theorem. 


Theorem 5 ff; F-dS, the surface integral of F over S, is equal to the integral 
of the normal component of F over the surface. In short, 


e [[F-nas. 


The observation in Theorem 5 can often save computational effort, as Example 4 
demonstrates. 


The Physical Interpretation of Surface Integrals 


The geometric and physical significance of the surface integral can be understood by 
expressing it as a limit of Riemann sums. For simplicity, we assume D is a rectangle. 
Fix a parametrization ® of S that preserves orientation and partition the region D into 
n? pieces Di0 <i <n—1,0 < j < n—1. We let Au denote the length of the 
horizontal side of Di; and Av denote the length of the vertical side of Dj. Let (u, v) 
be a point in Dij, and (x, y,Z) = ®(u, v) the corresponding point on the surface. We 
consider the parallelogram with sides Au T, and Av T, lying in the plane tangent to S 
at (x, y, z) and the parallelepiped formed by F, AuT,, and AvT,. The volume of the 
parallelepiped is the absolute value of the triple product 


F-(AuTy x AvT,) =F-(Ty x T,) Au Av. 


Thevector T, x T, isnormal to the surface at (x, y, z) and points away from the outside 
of the surface. Thus, the number F -(Ty x T,) is positive when the parallelepiped lies 
on the outside of the surface (Figure 7.6.7). 

In general, the parallelepiped lies on that side of the surface away from which F 
is pointing. If we think of F as the velocity field of a fluid, F(x, y, z) is pointing in 
the direction in which fluid is moving across the surface near (x, y, z). Moreover, the 
number 


IF -(T, Au x T, Av)| 


measures the amount of fluid that passes through the tangent parallelogram per unittime. 
Because the sign of F - (Au Ty x Av T,) is positive if the vector F is pointing outward at 
(x, y, z) and negativeif F ispointinginward, X>; jF - (Tu xT,) Au Avisan approximate 
measure of the net quantity of fluid to flow outward across the surface per unit time. 


figure 7.6.7 F.T, x T,) > 0 
when the parallelpiped formed 
by AvT,, AuT,, and F lies to the 
“outside” of the surface S. 


example 4 


solution 
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Au 


Av 


figure 7.6.8 When F. (T, x T,) > 0 (left), 
F points outward; when F. (fT, x T) < 0 
(right), F points inward. 


(Remember that “outward” or “inward” depends on our choice of parametrization. 
Figure 7.6.8 illustrates F directed outward and inward, given Tu and T,,.) Hence, the 
integral ffs F -dS is the net quantity of fluid to flow across the surface per unit time, 
that is, the rate of fluid flow. This integral is also called the flux of F across the surface. 

In the case where F represents an electric or a magnetic field, //,F-dS is also 
commonly known as the flux. The reader may be familiar with physical laws (such 
as Faraday’s law) that relate flux of a vector field to a circulation (or current) in a 
bounding loop. This is the historical and physical basis of Stokes’ theorem, which we 
will discuss in Section 8.2. The corresponding principle in fluid mechanics is called 
Kelvin’s circulation theorem. 

Surface integrals also apply to the study of heat flow. LetT (x, y, z) bethetemperature 
at a point (x, y,z) € W c R?, where W is some region and T isa C t function. Then 


represents the temperature gradient, and heat “flows” with the vector field —k VT =F, 
where k is a positive constant (see Section 8.5). Therefore, ffs F -dS is the total rate of 
heat flow or flux across the surface S. 


Suppose a temperature function is given in R? by theformulaT (x, y, z) = X? +y? +z, 
and let S be the unit sphere x? + y? + z? = 1 oriented with the outward normal (see 
Example 2). Find the heat flux across the surface S if k = 1. 


We have 


F = —VT (x, y, zZ) = —2xi — 2yj — 2zk. 
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example 5 


On S, the vector n(x, y, z) = xi+ yj +zk is the unit “outward” normal to S at (x, y, z), 
and f(x, y, z) = F -n = —2x? — 2y? — 2z? = —2 is the normal component of F. From 
Theorem 5 we can see that the surface integral of F is equal to the integral of its normal 
component f = F -nover S. Thus, 


f| F-as= ff tas =—2 ff as = -2415) = 2142) = -87 


The flux of heat is directed toward the center of the sphere (why toward?). Clearly, our 
observation that ff F- dS = ffs f dS has saved us considerable computational time. 

In this example, F(x, y, z) = —2xi—2yj —2zk could also represent an electric field, 
in which case ff; F - dS = —8z would be the electric flux across S. A 


Gauss’ Law There is an important physical law, due to the great mathematician and 
physicist K. F. Gauss, that relates the flux of an electric field E over a “closed” surface 
S (e.g., a sphere or an ellipsoid) to the net charge Q enclosed by the surface, namely (in 


suitable units), 
J|: dS=0 (1) 


(see Figure 7.6.9). Gauss’ law will be discussed in detail in Chapter 8. This law is 
analogous to A mpère’s law (see Example 12, Section 7.2). 

Suppose that E = E n; that is, E is a constant scalar multiple of the unit normal to 
S. Then Gauss’ law, equation (1) in Example 5, becomes 


[fe-ds= f[eas=e ffas =a 


because E = E - n. Thus, 


E = HT (2) 


S = closed surface 


figure 7.6.9 Gauss’ law: JI E-dS=Q 
E=electric field where Q is the net charge inside S. 


In the case where S is the sphere of radius R, equation (2) becomes 


Q 


ae 


(3) 
(see Figure 7.6.10). 

Now suppose that E arises from an isolated point charge, Q. From symmetry it is 
reasonable that E = En, where nis the unit normal to any sphere centered at Q . Hence, 
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Ya 


figure 7.6.10 The field E due to a point charge Q is 
E = Qn/4rR?. 


N 


equation (3) holds. Consider a second point charge, Qo, located at a distance R from 
Q.The force F that acts on this second charge, Q o, is given by 


QQo 


F=EQ,=EQon= FaR?" 


If F is the magnitude of F, we have 


_ 00 
~ AgR2' 


which is Coulomb's law for the force between two point charges.4 A 


Surface Integrals Over Graphs 


Finally, let us derive the surface-integral formulas for vector fields F over surfaces S 
that are graphs of functions. Consider the surface S described by z = g(x, y), where 
(x, y) e D, where S is oriented with the upward-pointing unit normal: 


ag\2 /ag\? 
y (ae) + Gy) + 
We have seen that we can parametrize S by ®: D —> R? given by (x, y) = 


(x, y, g(x, y)). In this case, [f F -dS can be written in a particularly simple form. 
We have 


= ag 
T, =i+ ax 
Thus, Tx x Ty = —(09/dx)i — (ag/ay)j +k. If F = Fii+ F2j + F3k is a continuous 
vector field, then we get 


4S ometimes one sees the formula F = (1/4zre9)Q Qo/R2. The extra constant eo appears when M K S 
units are used for measuring charge. We are using CGS, or Gaussian, units. 
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The Surface Integral of a Vector Field Over a Graph S$ 


[fr eas= ff FT, x ty) axay 
= [fi (—32) +r(-2) + Faxo. 


example 6 | The equations 
z=12, x?+y?<25 
describe a disc of radius 5 lying in the plane z = 12. Suppose r is the vector field 
r(x, y,Z) =xit+ yj + Zk. 
Compute ff; r-dS. 


solution | We shall do this in three ways. First, we have 0z/dx = 0z/dy = 0, because z = 12 is 
constant on the disc, so 


r(x, y,z)-(Ty x Ty) =r(x,y,z)- (ix j) =r(x, y, Zz) k =z. 


Using the original definition at the beginning of this section, the integral becomes 


Jfras= ff zaxay = ff 12 dx dy = 12(area of D) = 300z. 
5 D D 


A second solution: Because the disc is parallel to the xy plane, the outward unit normal 
is k. Hence, n(x, y, z) = k andr-n = z. However, ||T, x Ty|| = |Ik] = 1, and so we 
know from the discussion preceding Theorem 5 that 


[fr-as= ff rnos= [[zas= ff 12 dx dy = 3007. 


Third, we may solve this problem by using formula (4) directly, with g(x, y) = 12 and 
D the disc x? + y? < 25: 


[fr-68= [ [0-04 y-0-+ 12) dxdy = 12{area of D) = 300z. A 
S D 


Summary: Formulas for Surface Integrals 
1. Parametrized Surface: (u, v) 


a. Integral of a scalar function f: 


[frs= fff f(®(u, v))ITy x Todu de 


b. Scalar surface element: 


dS = |T, x T,||dudv 
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C. Integral of a vector field F: 


f| F-as= [fete xtduds 
S D 


d. Vector surface element: 
dS = (T, xT,)dudv=ndsS 


2. Graph: z = g(x, y) 
a. Integral of a scalar function f : 


(x, yY, g(x, y)) 
[ffas= [fo eT axty 
b. Scalar surface element: 
_dxdy — //əg\? (ag\? 
0 - (2) + (2) +1004 


where cos 6 = n-k, and nis a unit normal vector to the surface. 
E Integral of a vector field F: 


Ji ds = If (- Figo FASE 4 a) dxdy 


d. Vector surface element: 


3. Sphere: x? + y? +z? = R? 
a. Scalar surface element: 


dS = R? sing do d0 
b. Vector surface element: 


dS = (xi + yj +zk)R sin do d0 = rR sing dọ d0 = nR? sing do d0 


exercises 
1. Consider the closed surface S consisting of the graph 2. Evaluate the surface integral 
z = 1 — x? — y? with z > 0, and also the unit disc in the 
xy plane. Give this surface an outer normal. Compute: If Feds 


F- dS 

JI where F(x, y, z) = xi + yj + zk and S is the surface 
parameterized by ®(u, v) = (2 sinu, 3cosu, v), with 

where F(x, y, z) = (2x, 2y, Z). 0<u<2rand0<v<1. 
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3. Let F(x, y, z) = (x, y, z). Evaluate 
/ I F. dS, 
S 


(a) the upper hemisphere of radius 3, centered at the 
origin. 
(b) the entire sphere of radius 3, centered at the origin. 


where S is: 


» Let F(x, y, z) = 2xi — 2yj + 22k. Evaluate 


[LF 8s 


where S is the cylinder x2 + y? = 4 with z € [0, 1]. 


. Let the temperature of a point in IR? be given by 
T(x, y, Z) = 3x2 + 3z2. Compute the heat flux across 
the surface x? +z? =2,0<y <2,ifk=1. 


» Compute the heat flux across the unit sphere S if 
T(x, y, Z) = x. Can you interpret your answer 
physically? 


. Let S be the closed surface that consists of the 
hemisphere x2 + y? + z? = 1, z > 0, and its base 
x2 +y? < 1,z = 0. Let E be the electric field defined 
by E(x, y, z) = 2xi + 2yj + 2zk. Find the electric flux 
across S. (HINT: Break S into two pieces Sı and S2 and 
evaluate STs, E -dS and Ss, E - dS separately.) 


. Let the velocity field of a fluid be described by F = yi 
(measured in meters per second). Compute how many 
cubic meters of fluid per second are crossing the surface 
x 4227=10<y<10<x<1. 


x74 y24z = 4R? 


Restaurant 


Side view 


figure 7.6.11 Restaurant plans. 


*The solution to this problem may be somewhat time-consuming. 


9. Evaluate ff; (V x F)-dS, where S is the surface 


x? + y? + 3z? = 1, z < 0 and F is the vector field 
F = yi — xj + zx3y?k. (Let n, the unit normal, be 
upward pointing.) 


10. Evaluate ff (V x F)- dS, where F = (x? + y — 4)i+ 


3xyj + (2xz + z?)k and S is the surface x? + y? + 
z? = 16, z > 0. (Let n, the unit normal, be upward 
pointing.) 


11. Calculate the integral ff F- dS, where S is the entire 


surface of the solid half ball x? + y? +z? <1,z>0, 
and F = (x + 3y°)i+(y + 10xz)j + (z — xy)k. (Let S 
be oriented by the outward-pointing normal.) 


12.* A restaurant is being built on the side of a mountain. 


The architect's plans are shown in Figure 7.6.11. 


(a) The vertical curved wall of the restaurant is to be 
built of glass. W hat will be the surface area of this 
wall? 


To be large enough to be profitable, the consulting 
engineer informs the developer that the volume of 
the interior must exceed zR*/2. For what R does 
the proposed structure satisfy this requirement? 


During atypical summer day, the environs of the 
restaurant are subject to a temperature field given by 


(b 


(c 


T(x, y, Z) = 3x2 + (y — R)? + 1622. 


A heat flux density V = —k VT (k is a constant 
depending on the grade of insulation to be used) 
through all sides of the restaurant (including the top 
and the contact with the hill) produces a heat flux. 


Top view 


13. 


14. 


15. 


16. 


17. 


18. 
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W hat is this total heat flux? (Your answer will 
depend on R and k.) 


Find the flux of the vector field V(x, y, z) = 3xy*i + 
3x2yj + 22k out of the unit sphere. 


Evaluate the surface integral ff; F -nd A, where 
F(x, y, z) =i+j+2(x2 + y2)2k and S is the surface of 
the cylinder x? + y2<1,0<z<1. 


Let S be the surface of the unit sphere. Let F be a vector 
field and Fr its radial component. Prove that 


27 ud 
ffres- f | F, sing dedo. 
S 6=0 Jġ=0 


W hat is the corresponding formula for real-valued 
functions f? 


Prove the following mean-value theorem for surface 
integrals: If F is a continuous vector field, then 


JF nas =0F1)-mQuia(s) 
S 


for some point Q €e S, where A(S ) is the area of S. 
[HINT: Prove it for real functions first, by reducing the 
problem to one of a double integral: Show thatif g > 0, 


then 
1 fa aa =t) ff goa 
D D 


for some Q e D (do it by considering 
(Sfo f9dA)/( ffa 9 dA) and using the intermediate- 
value theorem).] 


Work out a formula like that in Exercise 15 for 
integration over the surface of a cylinder. 


Let S bea surface in IR? that is actually a subset D of the 
xy plane. Show that the integral of a scalar function 

f (x, y, z) over S reduces to the double integral of 

f(x, y, z) over D. What does the surface integral of a 


19. 


20. 


21. 


22. 
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vector field over S become? (M ake sure your answer is 
compatible with Example 6.) 


Let the velocity field of a fluid be described by 

F =i + xj + zk (measured in meters per second). 
Compute how many cubic meters of fluid per second are 
crossing the surface described by 

x24 y2472=1,2>0. 


(a) A uniform fluid that flows vertically downward 
(heavy rain) is described by the vector field 

F(x, y, z) = (0, 0, —1). Find the total flux through 
the cone z = (x2 + y?) 1/2, x? + y? <1, 

The rain is driven sideways by a strong wind so that 
it falls at a 45° angle, and it is described by 

F(x, y, z) = —(/2/2, 0, 2/2). Now what is the 
flux through the cone? 


(b 


Fora > 0,b > 0,c > 0, let S be the upper half ellipsoid 


= foya 


with orientation determined by the upward normal. 
Compute ffs F- dS where F(x, y, z) = (x3, 0, 0). 


2 2 2 
X yo z 
tir t aga zeol, 


If S is the upper hemisphere {(x, y, z) | x2 + y2 + 
z? = 1, z > 0} oriented by the normal pointing out of 
the sphere, compute ff F -dS for parts (a) and (b). 


(a) F(x, y, 2) = xi+ yj 

(b) F(x, y, 2) = yit xj 

(c) For each of these vector fields, compute 
Jf, (V xF). dS and fo F- ds, where C is the unit 
circle in the xy plane traversed in the 
counterclockwise direction (as viewed from the 
positive z axis). (Notice that C is the boundary of S. 
The phenomenon illustrated here will be studied 
more thoroughly in the next chapter, using Stokes’ 
theorem.) 


7.7 Applications to Differential Geometry, Physics, and Forms of Life* 


In the first half of the nineteenth century, the great G erman mathematician K arl Friedrich 
Gauss developed a theory of curved surfaces in IR?. M ore than a century earlier, Isaac 
Newton had defined a measure of the curvature of a space curve, and Gauss was able 
to find extensions of this idea of curvature that would apply to surfaces. In so doing, 
Gauss made several remarkable discoveries. 


*This section can be skipped on a first reading without loss of continuity. 
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Curvature of Surfaces 


For paths c: [a, b] — R? that have unit speed—thatis, |c'(t) || = 1— the curvature « of 
the image curve «(¢(t)) at the point c(t) is defined to be the length of the acceleration 
vector. Thatis, ||¢”(t) || = «(¢(t)). For paths cin space, the curvature is a true measure 
of the curvature of the geometric image curve C . As we saw at the end of Section 7.1, 
the “total curvature” f x ds over C has “topological” implications. The same, and even 
more, will hold for Gauss’ definition of the total curvature of a surface. We begin with 
some definitions. 
Let ®: D — R? be a smooth parametrized surface. Then, as we know, 


_ ae 
~ ðu 


_ a 


T 2 
i ðv 


and T, 


are tangent vectors to the image surface S = ®(D) at the point ®(u, v). We will also 
assume that there is a well-defined normal vector; that is, we assume the surface is 
regular: Ty x T, 40. 

Let 
a’ I 2 


2 
o 
7 | | E 
ðu ðu dv 


a® 
G = || — 
|= 


In Exercise 23 of Section 7.5, we saw that 
Ty x TI? =EG —F?. 


For notational reasons, we denote EG - F * by W. Furthermore, we let 


T, xT, Ty xT, 


-uxt (VW 


denote the unit normal vector to the image surface at p = ®(u, v). Next we will define 
two new measures of the curvature of a surface at p— the “Gauss curvature,” K (p), 
and the “mean curvature,” H ( p). Both of these curvatures have deep connections to the 
curvature of space curves, which illuminate the meaning of their definitions, but we do 
not explore these here. 

To define these two curvatures, we first define three new functions £, m,n on S as 


follows: 
2 
ep) = Nu, v) - a = N(u, v) -Bw 
2 
m(p) = N(u, v) + -© = N(U, v) Oy (1) 
2 
n(p) = N(u, v) 2 * =N(u, v) -®p. 
Ov 


The Gauss curvature K ( p) of S at p is given by 


én — m? 


K ( p) wo 
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and the mean curvature H ( p) of S at p is defined by! 


_ Gl+En=2F m 


H(p) zW i 


(3) 


where the right-hand sides of both expressions are calculated at the point p = ®(u, v). 


example | Planes Have Zero Curvature Let ®(u,v) = wu + Bu + y, (u, v) € R?, where 
a, B, y are vectors in R?. According to Example 1 of Section 7.3, this determines a 
parametrized plane in R?. Show that at every point, both the Gauss and mean curvatures 
are zero, and hence K and H vanish identically. 


solution Because ®,, = ®,, = ®,, = 0, the functions £, m,n vanish everywhere, and so 
do H and K. Thus, a plane has “zero” curvature. Hence, at least in this example, 
we ought to be convinced that H and K actually do measure the flatness of the plane. 
Conversely, we can show thatif H and K vanish identically, then S is part of a plane (see 
Exercise 12). A 


example 2 | Curvature of a Hemisphere Let 


(u, v) = (u, v, glu, v)), 


where g(u, v) = ~R? — u? — v? is a parametrization of the “upper hemisphere” of 
radius R . Show that the Gauss curvature at every point is 1/R* and the mean curvature 
is1/R. 


solution | Wemust first calculate the following quantities: 
Tu, To Tu X Ta, Puu, ®,,, Piv, E; G, Fel m, n. 


First of all, we have 


r u 
6, =T, =i ak 
=T, <j » k. 
J/R2 — v2 — v2 
From formula (2) in Section 7.3, we have 
ag, ag 
TxT,=-—i- — 
ae du ave 
E v . k 
of RPS? =e Jawa” ' 
Therefore, 
u2 R2? — v2 
2 
E = ital Sa Rue R= 
R? — u? 
2 
G =|®,|° = R2 (2 a4 
uv 
id i ee Te 


Technically speaking, K ( p) and H ( p) could, in principle, depend on the parametrization ® of S, 
but we can show that they are, in fact, independent of ®. 
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From Exercise 23 of Section 7.5, we know that 

(R? — v2)(R2 — u?) — u2v? 
(R2? — u? — v2}? 

R4 — R2yu2 — R2y2 R2 


IT, x T,|? =EG — F? = 


(R2? — u? — y2)2 (R2 — u? — v2) 
Now a direct calculation shows that 
R? — v? 
Pw = (R? —u2 — v2)3/2 
R2 — u2 
Pw = (R2 —u2 — v2)3⁄2 
Uv 
Pw = RI ye ; 
Furthermore, 
N T, xT, _ uxt, 
(Te xT (WW 
ee es u ; v iik 
~ R “ita” Raa 
1 
— ` (yi i 2_ y2_ y2k). 
= (ui+ of + VR u vk) 
Thus, 
1 R? — v? 
EE (r) 
1 R? — u? 
n=N-6,, = 
Cd) 
1 uv 
M leas Corea 
Therefore, 


m m1 (R? — v?) (R? — u?) — u20? 
~ R2 (R2 — u? — v2)2 
1 
= R? uy 


Dividing this by W yields K = 1/R?. Thus, the Gauss curvature does not change from 
point to point on the hemisphere; that is, it is constant. This conforms to our intuition 
that the sphere is perfectly symmetrical and that its curvature is everywhere equal. 
Hence, the mean curvature should also be constant. This is verified by the following 


figure 7.7.1 Soap bubble 
formation; H = constant. 


figure 7.7.2 A helicoid, H =0. 
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calculation: 


4 Gé+En—2Fm 
z 2W 


1 R2-u2 \1/ Rè- 
-l (aee) ilea) 
+( R?— v? Jal R? — u? ) 5 ey? } 
R2_—y2—py2/ R\R2_-u2—ve (R2 —u2— v2)? 
1 R 1 
=y ieoa) A 


Surfaces of Constant Curvature 


Surfaces of constant Gauss and mean curvature are of great interest to mathemati- 
cians. It was known in the nineteenth century that the only closed and bounded smooth 
surfaces with “no boundary” and with constant Gauss curvature were spheres. In the 
twentieth century, the Russian mathematician Alexandrov showed that the only closed 
and bounded smooth surfaces without a boundary that do not intersect themselves and 
that have constant mean curvature must also be spheres. M athematicians believed that 
Alexandrov’s result held even if the surface was allowed to intersect itself, but no one 
could find a proof. In 1984, Professor Henry Wente (Toledo, Ohio) startled the world 
by finding a self-intersecting torus of constant mean curvature. 

Surfaces of constant mean curvature are physically relevant and occur throughout 
nature. Soap bubble formations have constant nonzero mean curvature (See Figure 7.7.1), 
and soap film formations (containing no air) have constant mean curvature zero (see 
Figures 7.7.2 and 7.7.3). 

In the early nineteenth century, the French mathematician Delaunay discovered all 
surfaces of revolution that have constant mean curvature: the cylinder, sphere, catenoid, 
unduloid, and nodoid. The catenoid exists as a soap film surface spanning two circular 
contours. 


Optimal Shapes in Nature 


Throughout the ages, people have speculated on why things are shaped the way they 
are. W hy are the earth and the stars “round” and not cubical? W hy are life forms shaped 
the way they are? 


figure 7.7.3 A soap film, H =0, 
spanning two circular wires; this 
one is the catenoid. 
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Ku Slime? G 
a 


w CN S figure 7.7.4 Surfaces of 


revolution of constant mean 
curvature as unicellulars. 


< NIA 


In 1917, the British natural philosopher D’A rcy Thompson published a provocative 
work entitled On Growth and Form, in which he investigated the forces behind the 
creation of living forms in nature. He wrote: 


In an organism, great or small, itis not merely the nature of the motions of the living 
substance which we must interpret in terms of force (according to kinetics), but also the 
conformation of the organism itself, whose permanence or equilibrium is explained by the 
interaction or balance of forces, as described in statics. 


Surprisingly, Thompson discovered all of Delaunay’s surfaces in the form of unicellular 
organisms (see Figure 7.7.4). The constant mean curvature of these organisms can be 
explained by minimum principles similar to those described in the Historical Note in 
Section 3.3. In 1952, Watson and Crick determined that the structure of DNA is that of a 
double helix, a discovery that set the stage for the genetic revolution. We know from soap 
films, as in Figure 7.7.2, that nature likes helicoid forms, and nature tends to repeat pat- 
terns. A better understanding of the scientific principles underlying life may ultimately 
help mathematics play a more prominent role in this area of theoretical biology. 


Curvature and Physics 


The theory of curved surfaces, initiated by Gauss, has had a profound effect on physics. 
Gauss realized that the Gauss curvature K of asurface depended only on the measure of 
distance on the surface itself; that is, curvature was intrinsic to the surface. This is not 
true of the mean curvature H . Thus, beings “living” on the surface would be able to tell 
that the surface was curving, without any reference to an “external” world. Gauss himself 
found this mathematical result to be so striking that he named it theorema egregium, or 
“remarkable theorem.” Gauss’ theory was generalized by his student B ernhard Riemann 
to n-dimensional surfaces for which one could describe a notion of curvature. 

Recall that N ewton created the idea of a gravitational force acting over vast galactic 
distances, pulling galaxies together as well as pushing them apart (see Figure 7.7.5). In 
the early 1900s, Albert Einstein used Riemann’s ideas to develop the general theory of 
relativity, a theory of gravitation that eliminated the need to consider forces (as Newton 
did) acting over great distances. Einstein's theory explained the bending of light by the 
sun, black holes, the expansion of the universe, the formation of galaxies, and the Big 
Bang itself. For most applications, including the dynamics of our solar system, N ewton’s 
theory suffices and is commonly used today by NASA to plan space missions, as we saw 
in Section 4.1. But for cosmological applications on the grand scale, Einstein's theory 
replaced that of Isaac Newton, published in his Principia in 1687. 

As a testament to his genius, and despite the astounding success of this theory, 
Newton was nevertheless disturbed by questions about how this gravitational force 


figure 7.7.5 The Andromeda 
Galaxy. It will collide with the 
Milky Way in roughly 2 billion 
years. 


figure 7.7.6 Albert Einstein (1879- 
1955) at his desk in the Patent 


Office, Bern, 1905. 
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acted. He could give no other explanation than to say, “| have not been able to deduce 
from phenomena the reason for these properties of gravitation, and | do not invent 
hypotheses; for anything which cannot be deduced from phenomena should be called 
an hypothesis.” M oreover, in a letter to his friend, Richard Bentley, N ewton wrote: 


That gravity should be innate, inherent and essential to matter, so that one body may act 
upon another at a distance, through which their action may be conveyed from one to 
another, is to me so great an absurdity that | believe no man, who has in philosophical 
matters a competent faculty of thinking, can ever fall into it. 


Newton coined the term action at a distance (which means “force acting at a dis- 
tance”) to describe the mysterious effect of gravitation over large distances. This effect 
is as difficult to understand today as it was in N ewton’s time. 

Johann Bernoulli found it difficult to believe in the concept of a force that acts through 
a vacuum of space over distances of even hundreds of millions of miles. He viewed 
this force as a concept revolting to minds unaccustomed to accepting any principle in 
physics, save those that are incontestable and evident. A dditionally, Leibniz considered 
gravitation to be an incorporeal and inexplicable power, philosophically void. 

It was perhaps Albert Einstein's greatest inspiration (see Figure 7.7.6) to replace 
Newton's model of gravitation with a model that would have thrilled the early Greeks— a 
geometric model of gravitation. In Einstein's theory, the concept of a force acting through 
great distances has been replaced by the curvature of aspace-time?® world. As the quote 
at the beginning of the chapter illustrates, W. K. Clifford had a premonition of events to 
come! In order to elucidate Einstein’s scheme, we shall present an oversimplified model 
that conveys some of his basic ideas. 

We represent space by a surface that we imagine as an originally flat trampoline (the 
vacuum state), which is at some point strongly deformed by the weight of a gigantic 
steel ball (the sun). A tiny steel ball rolling on the trampoline is our planet Earth (see 
Figure 7.7.7) 

If we roll the small steel ball across the flat trampoline, it will travel in a straight-line 
path. However, if we now place the gigantic steel ball in the center, it will cause the 


165 pace-time is locally like Rt with three space coordinates and one time coordinate. 
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(b) (c) 


figure 7.7.7 (a) A particle on a taut trampoline moves in a straight line. (0) A heavy steel ball 
distorts the trampoline. (c) A particle on the distorted trampoline follows a curved path. 


trampoline to bend, or “curve,” even “far away” from the large ball. If we then give 
our little ball a push, it will no longer travel in a straight line but in a curved path. The 
big ball affects the trajectory of the little ball by curving the space around it. With just 
the right push, the little ball might even orbit the big one for a while. This trampoline 
model explains how a large body could, by curving space, influence a small one over 
great distance. 

Einstein stated that space-time is curved by matter and energy. In this curved space- 
time, even light rays are bent as they pass near massive objects like our sun. Thanks 
to Gauss and Riemann, the curvature of space-time requires no external “universe” in 
which it curves. M oreover, in Einstein’s curved world, light travels along shortest paths 
in space-time called geodesics. It is normally impossible to observe light rays close to 
the bright Sun, but a solar eclipse provides a marvelous opportunity for such measure- 
ments. Two British expeditions to New Guinea (under Eddington and Cottingham) and 
to Sobral in northern B razil used the solar eclipse of M ay 29, 1919, to observe whether 
light rays coming from stars and passing close to the Sun were bent. B oth expeditions 
were able to confirm Einstein's prediction, and Eddington later wrote, 


Oh leave the Wise our measures to collate; 

One thing, at least, is certain: LIGHT has WEIGHT. 

One thing is certain, and the rest debate: 

Light-rays, when near the Sun, DO NOT GO STRAIGHT. 


The equations that tell one how much space and time are curved by matter and energy 
are known as Einstein’s field equations. A description of them is beyond the scope of 
this book, but the mathematical kernel from which these equations arise is not; this 
kernel is based on another remarkable result of the research of Gauss and Bonnet. 


Gauss—Bonnet Theorem 


In Example 2, we computed the Gauss curvature K of the sphere x? + y? +z? = R? of 
radius R and found it to be the constant 1/R*. The Gauss curvature K is ascalar-valued 
function over the surface, and as such we can integrate it over the surface. We wish to 
consider a constant times this surface integral, namely, 


1 
= fK a. 


For the sphere of radius R, this quantity becomes 


1 An R? 
vent ff = sep =? 
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figure 7.7.8 A deformed sphere. 
1/2n ff KAA=2. 


Handle 


+ 
figure 7.7.9 Gluing a handle to 
a sphere to obtain a torus. - 


Torus 


W hat Gauss and Bonnet discovered was that if S is any “sphere-like” closed surface 
(closed and bounded, but with no boundary, as in Figure 7.7.8), then 


1 


still holds.?” 
Thus, the integral 


1 
x ff Ka 


always equals the integer 2, and is therefore a topological invariant of the surface. T hat 
the integral of curvature should be an interesting quantity should be already clear from 
the discussion at the end of Section 7.1. 

Now consider a torus, or doughnut. The torus can be considered as coming from the 
sphere by cutting out two discs and gluing in a handle (see Figure 7.7.9). 

Moreover, we can continue this process adding 1, 2, 3, ..., g handles to the sphere. 
If g handles are attached, we call the resulting surface a surface of genus g, as in 
Figure 7.7.10. Notice that the torus has genus 1. 

If two surfaces have a different genus, they are topologically distinct, and thus can- 
not be obtained from one another by bending or stretching. Interestingly, even two 
surfaces with the same genus can sit in space in quite different and complex ways, as 
in Figure 7.7.11. Astonishingly, even though the integral (or total curvature) given by 


17R oughly speaking, this means that S can be obtained from the sphere by bending and stretching (like 
with a balloon) but not tearing (the balloon bursts!). 
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figure 7.7.10 A sphere with O, 1, 
2,3 handles attached. 


figure 7.7.11 Two manifestations 
of a surface Sin R? of genus 2. 


figure 7.7.12 David Hilbert 
(1862-1943) was a leading 
mathematician of his time. 


2-8 
DE 


Simple double doughnut Baker’s pretzel 


(1/27) ffs K dA depends on the genus, it does not depend on how the surface sits in 
space (and thus not on K ). 
Gauss and Bonnet proved that 


=f K dA =2—2g. 
27 S 


Thus, for the sphere (g = 0), itis always 2 (already verified); for the torus, it is always 
0 (see Exercise 10). 

There is something even more remarkable connected to the theorem of Gauss- 
Bonnet, observed by the great German mathematician David Hilbert (Figure 7.7.12). 

Hilbert observed that the Gauss-Bonnet theorem is, in effect, a two-dimensional 
version of Einstein's field equations. In the physics literature, this fact is known as 
Hilbert's action principle in general relativity.!® Not surprisingly, similar geometric 
ideas are being employed by contemporary researchers in an effort to unify gravity and 
quantum mechanics— to “quantize” gravity, so to speak. 


18See C. Misner, K. Thorne, and A. Wheeler, Gravitation, Freeman, New York, 1972. 
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exercises 


1. The helicoid can be described by 
®(u, v) = (u cos v, u sin v, bv), where b + 0. 


Show that H = 0 and that K = —b2/(b? + u2). In 
Figures 7.7.1 and 7.7.5, we see that the helicoid is 
actually a soap film surface. Surfaces in which H = 0 
are called minimal surfaces. 


2. Consider the saddle surface z = xy. Show that 


—1 
~ (Tx + ye 
and that 

ee: Meee 
(1+ x2 + y2)3/2" 


3. Show that ®(u, v) = (u, v, log cos v — log cosu) has 


mean curvature zero (and is thus a minimal surface; see 


Exercise 1). 


4, Find the Gauss curvature of the elliptic paraboloid 
2 2 
X y 
5. Find the Gauss curvature of the hyperbolic paraboloid 
x2 y 
2= qt p 
6. Compute the Gauss curvature of the ellipsoid 
L y z 


atarta=h 


7. After finding K in Exercise 6, integrate K to show that: 


z [fk aa=2 
2n S 


8. Find the curvature K of: 


(a) the cylinder ®(u, v) = (2 cos v, 2sinv, u) 
(b) the surface @(u, v) = (u, v, u?) 


9. Show that E nneper’s surface 


u3 2 v? 2 2 72 
auo) = (u- Ftv- Eu v, US — v 


is a minimal surface (H = 0). 


10. Consider the torus T given in Exercise 4, Section 7.4. 
Compute its Gauss curvature and verify the theorem of 
Gauss- B onnet. [HINT: Show that 
Te x Tọll? = (R + cos ø)? and 
K =cos¢/(R + cos¢).] 


11. Let (u, v) = (u, h(u) cos v, h(u) sin v), h > 0, bea 
surface of revolution. Show that 
K = —h”/h{1 + (h')?}. 


12. A parametrization ® of a surface S is said to be 
conformal (see Section 7.4), provided that E = G, F = 
0. Assume that ® conformally parametrizes S.19 Show 
that if H and K vanish identically, then S must be part 
of a plane in R3. 


review exercises for chapter 7 


1. Integrate f(x, y, z) = xyz along the following paths: 
(a) elt) = (e cost, et sint, 3), 0 < t < 2x 
(b) elt) = (cost, sint, t), 0 <t < 2x 
(c) elt) = 3t7i+ 2t47+tk,O<t<1 
(d) elt) = ti + (1/V2t?j + tk, 0 <t <1 


2. Compute the integral of f along the path c in each of the 
following cases: 


(a) f(x,y,z) =x + y+ yz;c(t) = (sint, cost, t), 
0<t<2r 


(b) f(x, y, z) =x + cos? z; elt) = (sint, cost, t), 
0<t<2r 


19Gauss proved that conformal parametrization of a surface always exists. The result of this exercise remains valid even if ® is not conformal, 


but the proof is more difficult. 


424 


10. 


11. 


12. 


Integrals Over Paths and Surfaces 


(c) f(x y, 2) =x +y +z;elt) = (t, t?, 23), 
0<t<1 


. Compute each of the following line integrals: 


(a) Jc (sin ax) dy — (cos xy) dz, where C is the triangle 
whose vertices are (1, 0, 0), (0, 1, 0), and (0, 0, 1), 
in that order 

(b) fe (sinz) dx + (cosz) dy — (xy)4/3 dz, where C is 
the path c(@) = (cos? 4, sin? 6, 0), 0 < 0 < 77/2 


ı If F(x) is orthogonal to ¢’(t) at each point on the curve 


x = c(t), what can you say about Je F -ds? 


. Find the work done by the force 


F(x, y) = (x2 — y2)i + 2xyj in moving a particle 
counterclockwise around the square with corners 
(0, 0), (a, 0), (a, a), (0,a),a > 0. 


. A ring in the shape of the curve x2 + y? = a? is formed 


of thin wire weighing |x| + Iy] grams per unit length at 
(x, y). Find the mass of the ring. 


» Find a parametrization for each of the following surfaces: 


(a) x2? +y? +z? — 4x — 6y = 12 
(b) 2x? +y? +z? -8x=1 
(c) 4x2 +9y? — 22? =8 


. Find the area of the surface defined by 


®: (u, v) = (xX, y, z), where 


x =h(u,v) =u +v, 
z= f(u, v) =v; 


y =g(u, v) =u, 


0<u<1,0< v< 1. Sketch. 


. Write a formula for the surface area of 


®: (r, 0) > (x, y, z), where 


X =r cosé, y =2rsind, z=; 


0 <r <1,0 <6 < 2m. Describe the surface. 


Suppose z = f(x, y) and 

(af /ax)* + (af /ay)? = c, c > 0. Show that the area of 
the graph of f lying over a region D in the xy plane is 
»/1+c times the area of D. 


Compute the integral of f(x, y, Z) = x? + y? + z? over 
the surface in Review Exercise 8. 


Find ff; f dS in each of the following cases: 


(a) f(x, y,z) = x; S is the part of the plane 
X + y + z= lin the positive octant defined by 
x>0,y>0,z>0 


13. 


14, 


15. 


16. 


17. 


18. 


19. 


20. 


21. 


(b) f(x,y,z) = x2; S is the part of the plane x = z 
inside the cylinder x? + y? = 1 


(c) f(x, y, z) = x; S is the part of the cylinder 


x2 + y? = 2x with 0 < z < \/x2+4 y? 


Compute the integral of f(x, y, z) = xyz over the 
rectangle with vertices (1, 0, 1), (2, 0, 0), (1, 1, 1), and 
(2,1, 0). 


Compute the integral of x + y over the surface of the 
unit sphere. 


Compute the surface integral of x over the triangle with 
vertices (1, 1, 1), (2, 1, 1), and (2, 0, 3). 


A paraboloid of revolution S is parametrized by 
®(u, v) = (u cosv, u sin v, u2), 0 < u < 2,0 < v < 2x. 


(a) Find an equation in x, y, and z describing the 
surface. 


(b 


W hat are the geometric meanings of the parameters 
u and v? 


(c) Find a unit vector orthogonal to the surface at 


(u, v). 
(d 


Find the equation for the tangent plane at 
(Uo, vo) = (1, 1, 2) and express your answer in the 
following two ways: 


(i) parametrized by u and v; and 


(ii) in terms of x, y, and z. 


(e) Find the area of S. 


Let f(x, y, Z) = xe” cos zz. 
(a) Compute F = Vf. 


(b) Evaluate fe F -ds, where 
c(t) = (3cos*t, 5sin’t, 0),0<t < x. 


LetF(x, y, z) = xi + yj + zk. Evaluate ff; F- dS, 
where S is the upper hemisphere of the unit sphere 
x2 +y? +z? =L. 


Let F(x, y, z) = xi + yj + zk. Evaluate f, F- ds, where 
c(t) = (e,t, t?),0<t <1. 


Let F = Vf for a given scalar function. Let c(t) bea 
closed curve, thatis, c(b) = c(a). Show that 

J .F-ds=0. 

Consider the surface ®(u, v) = (u? cos v, u? sin v, u). 
Compute the unit normal atu = 1, v = 0. Compute the 
equation of the tangent plane at this point. 


22. 


23. 


24. 


25. 


26. 


27. 


Let S be the part of the cone z2? = x? + y? with z between 
1 and 2 oriented by the normal pointing out of the cone. 
Compute ff, F- dS, where F(x, y, z) = (x?, y?, z?). 


Let F = xi + x2j + yzk represent the velocity field of a 
fluid (velocity measured in meters per second). Compute 
how many cubic meters of fluid per second are crossing 

the xy plane through the square 0 < x < 1,0 < y <1. 


Show that the surface area of the part of the sphere 
x? + y? + z? = 1 lying above the rectangle 
[—a, a] x [—a, a], where 2a? < 1, in the xy plane is 


a a 
A=2] sin} dx 
L (Ss ) 


Let S bea surface and C a closed curve bounding S. 
Verify the equality 


[ftv xtias= [reas 
S Č 


if F is a gradient field (use Review Exercise 20). 


Calculate ff F- dS, where F(x, y, z) = (x, y, —y) and 
S is the cylindrical surface defined by x2 + y? = 1, 
0 <z <1, with normal pointing out of the cylinder. 


Let S be the portion of the cylinder x2 + y? = 4 between 
the planes z = 0 and z = x + 3. Compute the following: 


(a) ff, x? ds 
(b) ff, y?as 
(c) ff, z?ds 


28. 


29. 
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Let T be the curve of intersection of the plane 

z = ax + by, with the cylinder x2 + y? = 1. Find all 
values of the real numbers a and b such that a2 + b? = 1 
and 


[vex+(e-way-yar=0. 
E 


A circular helix that lies on the cylinder x2 + y? = R? 
with pitch p may be described parametrically by 


x = Rosé, y = Rsinð, Z= pd, 6>0. 


A particle slides under the action of gravity (which acts 
parallel to the z axis) without friction along the helix. If 
the particle starts out at the height zg > 0, then when it 


reaches the height z, 0 < z < Zo, along the helix, its 
speed is given by 


ds 
x= y (Zo — z)29, 


where s is arc length along the helix, g is the constant of 
gravity, and t is time. 


(a) Find the length of the part of the helix between the 
planes z = Z9 and z = 21,0 < Z1 < Z0. 


(b) Compute the time To it takes the particle to reach the 
planez = 0. 
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Tne Integral Ineorems 
of Vector Analysis 


From a long view of the history of mankind—seen from, say, ten thousand years from 
now—there can be little doubt that the most significant event of the nineteenth century will 
be judged as Maxwell's discovery of the laws of electrodynamics. The American Civil War 
will pale into provincial insignificance in comparison with this important scientific event of 


the same decade. —Richard Feynman 


The Special Theory of Relativity owes its origins to Maxwell's Equations... 


—Albert Einstein 


We are now prepared to tie together vector differential calculus and 
vector integral calculus. This will be done by means of the important the- 
orems of Green, Gauss, and Stokes. We shall also point out some of the 
physical applications of these theorems to the study of gravitation, elec- 
tricity, and magnetism. 

The basic integral theorems in vector analysis had their origins in ap- 
plications. For example, Green's theorem, discovered about 1828, arose 
in connection with potential theory (this includes gravitational and elec- 
trical potentials). Gauss’ theorem—the divergence theorem—arose in 
connection with the study of capillarity (this theorem should be jointly 
credited to the Russian mathematician Ostrogradsky, who discovered 
the theorem around the same time as Gauss). Stokes’ theorem was first 
suggested in a letter to Stokes from the physicist Lord Kelvin in 1850 and 


was used by Stokes on the examination for the Smith Prize in 1854. 
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8.1 Green’s Theorem 


figure 8.1.2 Two examples 
showing how to break the 
positively oriented boundary of a 
y-simple region D into oriented 
components. 


Green's theorem relates a line integral along a closed curve C in the plane R? to a 
double integral over the region enclosed by C . This important result will be generalized 
in the following sections to curves and surfaces in R?. We shall be referring to line 
integrals around curves that are the boundaries of elementary regions (see Section 5.3). 
To understand the ideas in this section, you may also need to refer to Section 7.2. 


Simple and Elementary Regions and Their Boundaries 


A simple closed curve C that is the boundary of an elementary region has two ori- 
entations— counterclockwise (positive) and clockwise (negative). We denote C with 
the counterclockwise orientation as C+, and with the clockwise orientation as C- 
(Figure 8.1.1). 

The boundary C of a y-simple region can be decomposed into bottom and top 
portions, Cı and C3, and (if applicable) left and right vertical portions, B4 and B3. 
Following Figure 8.1.2, we write, 


C*¥=C{P+B74+C7+87, 


where the pluses denote the curves oriented in the direction of left to right or bottom 
to top, and the minuses denote the curves oriented from right to left or from top to 
bottom. 

We can make a similar decomposition of the boundary of an x-simple region into 
left and right portions, and upper and lower horizontal portions (if applicable) 
(Figure 8.1.3). 

Similarly, the boundary of a simple region has two decompositions: one into upper 
and lower halves, the other into left and right halves. 


ct C 
figure 8.1.1 (a) Positive orientation 
of C, and (b) negative orientation 
of C. 
Positive orientation Negative orientation 
(a) (b) 
J Ct= Ct BS +C7+B7 Y Cha Cf+ Bt + Cy 


y= o2() 


\ 


y =o) (x) 
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C* = C}+ B3 + C7 +B} 


figure 8.1.3 An example showing how to break 
the positively oriented boundary of an x-simple 
region D into oriented components. 


Green’s Theorem 
We shall now prove two lemmas in preparation for G reen’s theorem. 


Lemma | Let D bea y-simple region and let C be its boundary. Suppose 
P: D — Ris of class C 1. Then 


ap 
pdx = — [f Z xoy. 
c+ p oy : 


(The left-hand side denotes the line integral fe, P dx + Q dy, where Q = 0.) 


proof Suppose D is described by 
asx<b — ¢(x) <y < g(x). 
We decompose C * by writing C+ = C} + BY + C3 + By (see Figure 8.1.2). By 


Fubini’s theorem, we can evaluate the double integral as an iterated integral and then 
use the fundamental theorem of calculus: 


ap b  pb2(x) aP 
pi sy Ce axdy = f f pyy dx 
b 
a LP (x, d2(x)) — P (x, r(x) ] dx. 


However, because C * can be parametrized by x +> (x, ġı(x)),a < x < b, and C * can 
be parametrized by x +> (x, @2(x)),a < x < b, we have 


b 
P(x, dx = P(x, y)d 
f (x, r(x) dx L (x, y) dx 


and 


b 
f P(x, #2(x)) x= f Pande 


430 


The Integral Theorems of Vector Analysis 


Thus, by reversing orientations, 


- [Pt giana =f P (x, y) dx. 
Hence, 
Jf Z=- [ P dx — P dx. 
D dy ce ce 
Because x is constant on B+ and BĮ, we have 
Pdx=0= P dx, 


By By 


sO 


pax= | Pax f Pax f P dx + Pies] P dx + P dx. 
c+ Cy Bs CF Br Cr Cs 


Thus, 
oP 
— dxd =- f P dx — P dx = — P dx. 
JI. əy : Ct é c+ z 


We now prove the analogous lemma with the roles of x and y interchanged. 


Lemma 2 Let D be an x-simple region with boundary C. Then if Q: D > R 
ee 
aQ 
a ay= ff 9x Ox a: 


The negative sign does not occur here, because reversing the role of x and y corresponds 
to a change of orientation for the plane. 


proof Suppose D is given by 


wily) <x < yy) c<y<d. 


Using the notation of Figure 8.1.3, and noting that y is constant on B} and BS, we have 


f ow=f agy= f ody f Q dy, 
C+ Ci +B} +C}+B7 cy Cr 


where C *¥ is the curve parametrized by y +> (wWo(y), y),c < y < d, and C? is the 
curve y +> (yaly), y),¢ < y < d. Applying Fubini’s theorem and the fundamental 


figure 8.1.4 The correct 


orientation for the boundary of a 


region D. 


Historical Note 
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theorem of calculus, we obtain 


JI R ays f 1 9 aedy = f tolay) y) - Q (yaly), y)ldy 


Q dy — Qdy= f Qdy+ f Qdy= f Qay. 
cf cz 


cyt ct C+ 


Adding the results of Lemmas 1 and 2 proves the following important theorem. 


Theorem | Green's Theorem Let D be a simple region and let C be its 
boundary. Suppose P: D —> Rand Q: D — Rareof class C t. Then 


f Pax+Qdy= ff (2-5) dx dy. 


The correct (positive) orientation for the boundary curves of region D can be re- 
membered by the following device: If you walk along the curve C with the correct 
orientation, the region D will be on your left (see Figure 8.1.4). 


George Green 


George Green was born in Nottingham England in 1793 and died in 1841. No 
picture of him is known to exist. Little is known about his early life before the age 
of 30 other than that he was a self-taught man, who developed a passion for 
mathematics at an early age. In 1833, at age 40, Green enrolled as a student in 
Cambridge. Amazingly, five years earlier, he had published (at his own expense) 
his first and most famous work “An Essay on the Application of Mathematical 
Analysis to the Theories of Electricity and Magnetism.” Here he proved a theorem 
similar to the Green’s Theorem of this section and also introduced other very 
important mathematical concepts such as Green’s functions, now so ubiquitous 
in mathematical analysis. He was the first person to create a theory of electricity 
and magnetism, a theory that later formed the basis of the work of Maxwell and 
others. 
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example | 


solution 


Ci figure 8.1.5 Green's theorem applies to 
D = D uU D U D3 U Da. 


D =D,UD3UD3U Dy 


Generalizing Green’s Theorem 


Green's theorem actually applies to any “decent” region in IR’. For instance, Green's 
theorem applies to regions that are not simple, but that can be broken up into pieces, 
each of which is simple. An example is shown in Figure 8.1.5. The region D is an 
annulus; its boundary consists of two curves C = Cı + C3 with the indicated orien- 
tations. (Note that for the inner region the correct orientation to ensure the validity of 
Green's theorem is clockwise; the device in Figure 8.1.4 still works for remembering 
the orientation!) If Theorem 1 is applied to each of the regions D;, D2, D3, and D 4 and 
the results are summed, the equality of Green’s theorem will be obtained for D and its 
boundary curve C . This works because the integrals along the interior lines in opposite 
directions cancel. This trick, in fact, shows that Green’s theorem holds for virtually all 
regions with reasonable boundaries that one is likely to encounter (see Exercise 16). 

Let us use the notation aD for the oriented curve C +, that is, the boundary curve of 
D oriented in the sense as described by the device in Figure 8.1.4. Then we can write 
Green's theorem as 


i Pax+Qdy= ff (2 -Zlio 


Green's theorem is very useful becauseit relates a line integral around the boundary of 
aregion to an area integral over the interior of the region, and in many cases itis easier to 
evaluate the line integral than the area integral or vice versa. For example, if we know that 
P vanishes on the boundary, we can immediately conclude that [/, (oP /ay) dxdy = 0 
even though oP /dy need not vanish on the interior. (Can you construct such a P on the 
unit square?) 


Verify Green’s theorem for P(x, y) = x and Q(x, y) = xy, where D is the unit disc 
x +y’ <1 


We do this by evaluating both sides in Green’s theorem directly. The boundary of D is 
the unit circle parametrized by x = cost, y = sint, 0 < t < 27, and so 


2x 
i Pax+Qdy= f [(cost)(—sint) + cost sint cost] dt 
aD 0 


B zal wt o 
= 2 0 3 0 E 


example 2 
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On the other hand, 


E-E) ffraw 


which is also zero by symmetry. Thus, Green's theorem is verified in this case. A 


Areas 


We can use Green's theorem to obtain a formula for the area of a region bounded by a 
simple closed curve. 


Theorem 2 Area of a Region If C isa simple closed curve that bounds a 
region to which Green’s theorem applies, then the area of the region D bounded 


by C = aD is 
i 
= xdy — y dx. 
J) yey 


proof Let?P (x, y) =—y, Q(x, y) = x; then by Green’s theorem we have 


5 [xa - yax=5 2 P| oes 
ere m 


Leta > 0. Computethe area (see Figure 8.1.6) of the region enclosed by the hypocycloid 
defined by x2/3 + y?/3 = a? using the parametrization 


x =a cos? ð, y=asin0, 0<0<2r. 


X 


(0, a) 


(a za) 


figure 8.1.6 The hypocycloid x = acos? 6, y= asino, O < 0 < 2x. 
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solution From the preceding box, and using the trigonometric identities cos?@ + sin?@ = 
1, sin20 = 2sin6cos@, and sin? @ = (1 — cos2¢) /2, we get 


1 
= xdy — y dx 
F) y-y 


20 
= 7 [(a cos? @)(3a sin? 6 cos) — (a sin? @)(—3a cos? 6 sing) |] de 
0 


3 20 3 27 , 
= 52° f (sin? 6 cost 6 + cos? 8 sint 6) do = 52° | sin? 6 cos? 8 da 


0 0 
ee eee 3. [7 /1— cos 40 
= 3? f sin‘ 20 d0 = 3? f A dé 
-3e TE [ cos40 d8 = = 
~ 16 7 A 


Vector Form Using the Curl 


The statement of Green’s theorem can be neatly rewritten in the language of vector 
fields. As we will see, this points the way to one possible generalization to R°. 


Theorem 3 Vector Form of Green’s Theorem Let D c R? bea region 
to which Green’s theorem applies, let dD be its (positively oriented) boundary, 
and let F = P i + Qj bea C t vector field on D. Then 


| F-as= ff (cur)-k da = ff (V xF)-k dA 


(see Figure 8.1.7). 


This result follows from Theorem 1 and the fact that (V x F) -k = dQ /dx — oP /ay. 
We ask you to supply the details in Exercise 22. 


z 


figure 8.1.7 The vector form of Green's 
theorem. 


example 3 


solution 
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LetF = (xy?, y + x). Integrate (V x F) - k over the region in the first quadrant bounded 
by the curves y = x? and y = x. 


M ethod 1. We first compute the curl 


vxF=(0, 0, T Fe) = (1 - 2xy)k. 
əx ay 


Thus, (V x F)-k = 1 — 2xy. This can be integrated over the given region D (see 
Figure 8.1.8) using an iterated integral as follows: 


[fxn Karey = f f pene 


1 
== == 5 = = 
= fi x3 — x? + x°] dx = 7 ste 


figure 8.1.8 The region bounded by the curves y= x? 
and y= x. 


Method 2. Here we use Theorem 3 to obtain 


J [9 xP) -kaxay = F-ds. 
D aD 


The line integral of F along the curve y = x from left to right is 


1 1 1 5 
1 Fide+ Fady= f (x9 42x) dx= 54153. 
0 0 4 4 


Along the curve y = x* we get 
1 1 2 4 
f FidxtFaay= f XP dx +(x + x2)(2x dx) =z +3 45 = 5. 

0 0 


Thus, remembering that the integral along y = x is to be taken from right to left, as in 
Figure 8.1.8, 


4 5 


aD 
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Vector Form Using the Divergence 


There is another form of Green's theorem that can be generalized to R?. 


Theorem 4 Divergence Theorem in the Plane Let D c R? be a region 
to which Green's theorem applies and let dD be its boundary. Let n denote the 
outward unit normal to aD. If c: [a,b] > R?,t => celt) = (x(t), y(t)) isa 
positively oriented parametrization of dD, nis given by 


__ (yt), x'e) 
Jar + TOP 


(see Figure 8.1.9). Let F = P i + Qj beaC? vector field on D. Then 


Fends = / div FdA. 
aD D 


proof Recall that c(t) = (x(t), y/(t)) is tangent to oD, and note that n-c’ = 0. 
Thus, n is normal to the boundary. The sign of n is chosen to make it correspond to 
the outward (rather than the inward) direction. By the definition of the line integral (see 
Section 7.2), 


PP PC x(t), y(t) y(t) — Q (x(t), y 5 z 
Ls nds = f Jers VOT t Rt )P + y(t) dt 


By Green's theorem, this equals 


I (2+2) aay= ff aiveda. 


n 


figure 8.1.9 nis the outward unit normal to aD. 


example 4 
square. 


solution 
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Let F = y?i + x*j. Compute the integral of the normal component of F around the unit 


This can be done using the divergence theorem. Indeed 


f Fnas= | div F dA. 
oD D 


But div F = 0, and so the integral is zero. A 


exercises 


L. 


Let D bethe triangle in the xy plane with vertices at 
(—1, 1), (1, 0), and (3, 2). Describe the boundary aD as 
a piecewise smooth curve, oriented counterclockwise. 


2. 


Let D be the region in the xy plane lying between the 
curves y = x? + 4 and y = 2x2. Describe the boundary 
aD as a piecewise smooth curve, oriented 
counterclockwise. 


In Exercises 3 to 6, verify Green’s theorem for the indicated region D and boundary oD, and functions P and Q. 


3. 


4. 


10. 


11. 


D = [-1, 1] x[-1,1], P(x, y)=- 

Q(x, y) =x 

D =[-1, 1] x [-1, 1], P(x, y) =x, Q(x, y)=y 
D= [-1, 1] x [-1, 1], P(x, y) =i y, 

Q(x, y)=x+y 

[HINT: Use 3 and 4.] 

D = [0,5] x [05], P(x, y) =sinx, 

Q(x, y) = cosy 


Let C be the closed, piecewise smooth curve formed by 
traveling in straight lines between the points (—2, 1), 
(—2, —3), (1, —1), (1, 5), and back to (—2, 1), in that 
order. Use Green's theorem to evaluate the integral: 


[12x ox xy? ay 
C 


A particle travels across a flat surface, moving due east 
for 3 m, then due north for 4 m, and then returns to its 
origin. A force field acts on the particle, given by 

F(x, y) = (3x + 4y2)i + (10xy)j. (Here we assume that 
j points north.) Use G reen’s theorem to find the work 
done on the particle by F. 


Evaluate fe y dx — xdy, where C is the boundary of the 
square [—1, 1] x [—1, 1] oriented in the 
counterclockwise direction, using Green's theorem. 


Find the area of the disc D of radius R using Green’s 
theorem. 


Verify Green's theorem for the disc D with center (0, 0) 
and radius R and the functions: 


12. 


13. 


14. 


15. 


(a) P(x, y) = xy?, Q(x, y) = —yx? 

(b) P(x,y) =X+y, Q(x, y) =y 

(c) P(x, y) =xy = Q(x, y) 

(d) P(x,y) =2y, Q(x, y) =x 

Using the divergence theorem, show that 

os F -nds = 0, where F(x, y) = yi — xj and D is the 
unit disc. Verify this directly. 

Find the area bounded by one arc of the cycloid 

x =a(6—sin@), y =a(1 — cos), wherea > 0, and 
0 < 0 < 2x, and the x axis (use Green's theorem). 
Under the conditions of Green's theorem, prove that 


a f PQ dx + PQ dy = 
oP  əP Q Q 
{he (aza) ea) e 
aQ 
o f a (P ay OF ay 
2 
TE — - my) 
ax dy ax dy 


Evaluate the line integral 


[ee — y?) dx + (xX? + y?) dy, 
E 


whereC is the unit circle, and verify G reen’s theorem 
for this case. 
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16. 


17. 


18. 


19. 


20. 


21. 
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Prove the following generalization of Green’s theorem: 
Let D bea region in the xy plane with boundary a finite 
number of oriented simple closed curves. Suppose that 
by means of a finite number of line segments parallel to 
the coordinate axes, D can be decomposed into a finite 
number of simple regions D; with the boundary of each 
Dj oriented counterclockwise (see Figure 8.1.5). Then if 
P and Q areof class C t on D, 


IEG Hy )oxay= f Pox + aay 


where aD is the oriented boundary of D. (HINT: Apply 
Green's theorem to each Dj.) 


Verify Green's theorem for the integrand of Exercise 15 
(that is, with P = 2x3 — y3 and Q = x? + y3) and the 
annular region D described by a < x? + y? <b, with 
boundaries oriented as in Figure 8.1.5. 


Let D bea region for which Green's theorem holds. 
Suppose f is harmonic; that is, 
af att 
əx? əy? 
on D. Prove that 


f: ax -*y=0. 


(a) Verify the divergence theorem for F = xi + yj and 
D the unit disc x? + y? < 1. 

(b) Evaluate the integral of the normal component of 
2xyi — y?j around the ellipse defined by 
x?/a? +y? /b? =1. 


=0 


Let P(x, y) = —y/(x2 + y2) and 
Q(x, y) = x/(x2 + y2). Assuming D is the unit disc, 
investigate why Green's theorem fails for this P and Q. 


Use Green's theorem to evaluate fe, (y? + x3) dx + 
x“dy, where C + is the perimeter of the square [0, 1] x 
[0, 1] in the counterclockwise direction. 


22. 


23. 


24. 


25. 


26. 


27. 


28. 


Verify Theorem 3 by showing that 
(V x F) -k = 3Q /ax — əP /ay. 


U se Theorem 2 to compute the area inside the ellipse 
x?/a? + y?/b? = 1, 


Use Theorem 2 to recover the formula A = 4 [°r2do 
for a region in polar coordinates. 


Sketch the proof of Green's theorem for the region 
shown in Figure 8.1.10. 


figure 8.1.10 Prove 
Green's theorem for 
this region. 


Prove the identity 
f ovo-nas= f ($V? + Vo- Vo) dA. 
aD D 


Use Green's theorem to find the area of one loop of the 
four-leafed roser = 3sin 26. (HINT: 
x dy — y dx =r2d0.) 


Show thatif C is a simple closed curve that bounds a 


region to which Green's theorem applies, then the area 
of the region D bounded by C is 


a= y=- | y dx. 
aD aD 


Show how this implies Theorem 2. 


Exercises 29 to 37 illustrate the application of Green's theorem to partial differential equations. (Further applications are 
given in the Internet supplement.) They are particularly concerned with solutions to Laplace's equation, that is, with 
harmonic functions. For these exercises, let D be an open region in R? with boundary aD.Letu: D UaD + Rbea 
continuous function that is of class C? on D. Suppose p € D and the closed discs B, = B,(p) of radius p centered at p are 
contained in D for 0 < p < R. Define | (o) by 


29. 
30. 


Mol == f u ds. 
P Sas 


Show that limit,.0 I () = 2zu(p). 


Let n denote the outward unit normal to 3B p 


and ðu /ðn = Vu -n. Show that 


Í Zass f) V2u dA. 
aB, an B 


p p 


31. 


32. 


33. 


34. 


Using Exercise 30, show that 
1'(o) = (1/9) Se, Vu dA. 


Suppose u satisfies Laplace's equation: V2u = 0 on D. 
Use the preceding exercises to show that 


1 
u(p) = van |, 8° 


(This expresses the fact that the value of a harmonic 
function at a point is the average of its values on the 
circumference of any disc centered about it.) 


Use Exercise 32 to show that if u is harmonic (i.e, if 
V2u = 0), then u(p) can be expressed as an area integral 


1 
up) = spa ff 4 a. 


Suppose u is a harmonic function defined on D (i.e., 
V2u = 0 on D ) and that u has a local maximum (or 
minimum) at a point pin D. 


(a) Show that u must be constant on some disc centered 
at p. (HINT: Use the results of Exercise 25.) 


(b) Suppose that D is path-connected [i.e., for any 
points p and qin D , there is a continuous path 
c: [0,1] — D such that ¢(0) = p and ¢(1) = q] and 
that for some p the maximum or minimum at p is 
absolute; thus, u(q) < u(p) or u(q) > u(p) for 
every qin D. Show that u must be constant on D. 


(The result in this Exercise is called a strong 
maximum or minimum principle for harmonic 
functions. Compare this with Exercises 46 to 50 in 
Section 3.3.) 
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35. 


36. 


37. 


38. 
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A function is said to be subharmonic on D if V2u > 0 
everywhere in D. Itis said to be superharmonic if 
v2u < 0. 


(a) Derive a strong maximum principle for 
subharmonic functions. 


(b) Derive a strong minimum principle for 
superharmonic functions. 


Suppose D is the disc {(x, y) | x2 +y? < 1} andC is 
the circle {(x, y) | x? + y? = 1}. In the Internet 
supplement, we shall show that if f is a continuous 
real-valued function on C, then there is a continuous 
function u on D UC that agrees with f on C and is 
harmonic on D. Thatis, f has a harmonic extension to 
the disc. A ssuming this, show the following: 


(a) If q isa nonconstant continuous function on D UC 
that is subharmonic (but not harmonic) on D , then 
there is a continuous function u on D UC thatis 
harmonic on D such that u agrees with q on C and 
q < u everywhereon D. 


The same assertion holds if “subharmonic” is 
replaced by “superharmonic” and “q < u” by 


(b 


Let D beas in Exercise 36. Let f: D + R be 
continuous. Show that a solution to the equation 
V2u = 0 satisfying u(x) = f(x) forall x € aD is 
unique. 


Use Green's theorem to prove the change of variables 
formula in the following special case: 


= a(x, y) 
[hee fh. la 


dudv 
v) 
for a transformation (u, v) > (x(u, v), y(u, v)). 


Stokes’ theorem relates the line integral of a vector field around a simple closed curve 
C in R? to an integral over a surface S for which C is the boundary. In this regard it is 
very much like Green’s theorem. 


Stokes’ Theorem for Graphs 


Let us begin by recalling a few facts from Chapter 7. Consider a surface S that is the 
graph of a function f(x, y), so that S is parametrized by 


X=u 
y=v 
z= f(u, v) = f(x,y) 
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for (u, v) in some domain D in the plane. The integral of a vector function F over S 
was developed in Section 7.6 as 


[fF -as= | ff: (-=) +F? (-=) +r idy, (1) 


where F = Fii + Fj + F3k. 

In Section 8.1, we first assumed that the regions D under consideration were simple; 
while this was used in our proof of Green’s theorem, we noted there that the theorem 
is valid for a wider class of regions. In this section we assume that D is a region whose 
boundary isasimpleclosed curveand to which Green's theorem applies. Green’s theorem 
involves choosing an orientation on the boundary of D , as was explained in Section 8.1. 
The choice of orientation that validates Green's theorem will be called positive. Recall 
that if D is simple, then the positive orientation is the counterclockwise one. 

Suppose that c: [a, b] —> R?, c(t) = (x(t), y(t)) is a parametrization of aD in the 
positive direction. Then we define the boundary curve 0S to be the oriented simple 
closed curve that is the image of the mapping p: t +> (x(t), y(t), f (x(t), y(t))) with 
the orientation induced by p (Figure 8.2.1). 

To remember this orientation (i.e., the positive direction) on aS, imagine that you 
are an “observer” walking along the boundary of the surface with the normal as your 
upright direction; you are moving in the positive direction if the surface is on your left. 
This orientation on 0S is often called the orientation induced by an upward normal n. 


Theorem 5 Stokes’ Theorem for Graphs Let S be the oriented surface 
defined by a C? function z = f(x, y), where (x, y) € D, a region to which 
Green's theorem applies, and let F be aC! vector field on S. Then if aS denotes 
the oriented boundary curve of S as just defined, we have 


[feutt-as= [ftv xF)-as= f Fas 


Remember that f, F -dsis the integral around ðS of the tangential component of F, 
while ff G -dS is the integral over S of G -n, the normal component of G (see Sec- 
tions 7.2 and 7.6). Thus, Stokes’ theorem says that the integral of the normal component 
of the curl of a vector field F over a surface S is equal to the integral of the tangential 
component of F around the boundary aS. 


4 


n 
/ 
> i figure 8.2.1 The induced orientation on aS: 


As you walk around the boundary, the 
surface should be on your left. 
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proof If F =F i+ F2j + Fsk, then 
F F2\. F F3\, F oF 
curl F = ms oe i+ Fi wa j+ F OFi k 
oy OZ OZ Ox ox oy 
Therefore, we use formula (1) to write 


[hewras~ a ae) (ax) 


OF y OF 3 OZ OF 2 OF 
a[o me) ( m) + pja (2) 
On the other hand, 


[Feas= [Fos = J Fidx+ Fady + Fad 
as p p 


where p: [a, b] > R?, p(t) = (x(t), y(t), f (x(t), y(t))) is the orientation-preserving 
parametrization of the oriented simple closed curve dS discussed earlier. Thus, 


dx. dy o de 
Fass f GE +f hha E)E (3) 


By the chain rule, 
dz ozdx  azdy 
dt  axdt əy dt’ 


Substituting this expression into equation (3), we obtain 


' az\ dx dy 
[Fas= [ (Gai) at +(Fo+h Z) x dt 
= (ntr) (Feta dy (4) 
c oy 
=} (Fatra SE) ax (Fa + Fa) oy 


Applying Green’s theorem to equation (4) yields (we are assuming that Green's theorem 
applies to D) 


Jf ae To 
D ox ay l 


Now we use the chain rule, remembering that F;, F2, and F are functions of x, y, and 
z and that z is a function of x and y, to obtain 


as OF, əz  OF39Z OF 3 OZ AZ a2z 
+ + +F3 
+ oz ax | ox ay az Ox ay ax ay 
A OF, OZ əFzə3z Fz ðZ AZ 3?z 
dy | az ay ay ax az ayax  `əƏyəðx 


Because mixed partials are equal, the last two terms in each parenthesis cancel each 
other, and we can rearrange terms to obtain the integral of equation (2), which completes 
the proof. E 
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example | 


solution 


example 2 


solution 


Let F = ye*i + xe’j + xye’k. Show that the integral of F around an oriented simple 
closed curve C that is the boundary of a surface S is 0. (Assume S is the graph of a 
function, as in Theorem 5.) 


Indeed, fe F-ds= ffi (V x F)- dS, by Stokes’ theorem. But we compute 


i j k 
yvxF= 2? 2 2 26 
əx ay az 


and so fẹ F-ds = 0. Alternatively, we can observe that F = V(xye’), so its integral 
around a closed curveiszero. A 


Use Stokes’ theorem to evaluate the line integral 
| =y dx + xĉdy — z? dz, 
Ç 
where C is the intersection of the cylinder x? + y? = 1 and the plane x + y +z = 1, 
and the orientation on C corresponds to counterclockwise motion in the xy plane. 


The curve C bounds the surface S defined by the equation z = 1 — x — y = f (x, y) 
for (x, y) in the set D = {(x, y) | x? + y? < 1} (Figure 8.2.2). We set F = —y3i+ 
x3j — 23k, which has curl V x F = (3x? + 3y?)k. Then, by Stokes’ theorem, the line 
integral is equal to the surface integral 


[fev mas. 


But V x F has only a k component. Thus, by formula (1) we have 


(V xF)-dS= // (3x? + 3y?) dxdy. 
Ih Ih 


figure 8.2.2 The curve Cis the intersection of the 
cylinder x2 + y? = 1 and the plane x+ y+ z=1. 
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This integral can be evaluated by changing to polar coordinates. Doing this, we get 


1 2x 1 
3 [f+ dxdy=3 f Perdodr =x fdr =o LA 
D 0 0 0 


4 2 


Let us verify this result by directly evaluating the line integral 


1 —y? dx + x? dy — z’ dz. 
Ç 
We can parametrize the curve aD by the equations 
x = cost, y =sint, z=0, 0<t <02rn. 
The curve C is therefore parametrized by the equations 
x = cost, y =sint, 


z= 1l- sint —cost, 0<t<2r. 
Thus, 


I — y? dx + xĉ?dy — 7? dz 


Cc 


27 
=] [(—sin?t)(—sint) + (cos? t)(cost) 
0 


— (1 = sint — cost)7(—cost + sint)] dt 


2x 2m 
z (cos*t + sin‘ t) a- | (1 = sint — cost)?(—cost + sint) dt. 
0 0 


The second integrand is of the form u? du, where u = 1 — sint — cost, and thus the 
integral is equal to 


I A 
gl(1—sint — cost)*]?” = 


Hence, we are left with 


20 
i (cos*t + sinf t) dt. 
0 


This integral can be evaluated using formulas (18) and (19) of the table of integrals. 
We can also proceed as follows. Using the trigonometric identities 


1 —cos2t 
2 I 


; 1 2t 
sin? t = eee ea 


2 I 
substituting and squaring these expressions, we reduce the preceding integral to 


1 27 1 27 
a (1+ cos? 2t) t=x+3 cos? 2t dt. 
2 0 2 0 


Again using the identity cos? 2t = (1 + cos 4t)/2, we find 


1 2x 1 20 1 2x 
+5} (1+ cosdt) dt= 2+ 5 | a+; f cos 4t dt 
4 Jo 4 Jo 4 Jo 


3x 


T 
=r+53t0=5 À 
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figure 8.2.3 The surface S is a portion of a sphere. 


“Mistaken” boundary of S 


Stokes’ Theorem for Parametrized Surfaces 


To simplify the proof of Stokes’ theorem given earlier, we assumed that the surface S 
could be described as the graph of a function z = f(x, y),(x, y) € D, where D is 
some region to which Green’s theorem applies. H owever, without too much more effort 
we can obtain a more general theorem for oriented parametrized surfaces S. The main 
complication is in the definition of aS. 

Suppose ®: D — R? is a parametrization of a surface S and c(t) = (u(t), v(t)) isa 
parametrization of dD. We might be tempted to define oS as the curve parametrized by 
t p(t) = (u(t), v(t)). However, with this definition, dS might not be the boundary 
of S in any reasonable geometric sense. 

For example, we would conclude that the boundary of the unit sphere S parametrized 
by spherical coordinates in IR? is half of the great circle on S lying in the xz plane, but 
clearly in a geometric sense S is a smooth surface (no points or cusps) with no boundary 
or edge at all (see Figure 8.2.3 and Exercise 20). Thus, this great circle is in some sense 
the “mistaken” boundary of S. 

We can get around this difficulty by assuming that ® is one-to-one on all of D. 
Then the image of dD under ®, namely, ®(dD), will be the geometric boundary of 
S = (D). If c(t) = (u(t), v(t)) is a parametrization of dD in the positive direction, 
we define aS to be the oriented simple closed curve that is the image of the mapping 
p: t > (u(t), v(t)), with the orientation of aS induced by p (see Figure 8.2.1). 


Theorem 6 Stokes’ Theorem: Parametrized Surfaces Let S be an ori- 
ented surface defined by a one-to-one parametrization ®: D c R? —> S, where D 
is a region to which Green's theorem applies. Let aS denote the oriented boundary 
of S and let F be aC! vector field on S. Then 


[ftv xti-ds= | F-os 


If S has no boundary, and this includes surfaces such as the sphere, then the integral 
on the left is zero (see Exercise 25). 


This is proved in the same way as Theorem 5. 


example 3 | Let S be the surface shown in Figure 8.2.4, with the indicated orientation. Let F = 
yi — xj + e*’k. Evaluate [f (V x F)-dS. 


solution 
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figure 8.2.4 This surface Sis a portion of a sphere 
sitting on top of the circle x? + y? = 1. It does not 
include the disc x? + y? < 1 in the xy plane. 


This surface could be parametrized using spherical coordinates based at the center of 
the sphere. However, we need not explicitly find ® in order to solve this problem. 
By Theorem 6, ffs (V x F)-dS = f F-ds, and so if we parametrize aS by x(t) = 
cost, y(t) = sint, 0 < t < 27, we determine 


2x dx dy 2n = J 2n 
fras] (VE) a= f (— sint — cos d=- f dt = —27x 


and therefore ffs (V x F)-dS=-—27. A 


The Curl as Circulation per Unit Area 


L et us now use Stokes’ theorem to justify the physical interpretation of V x F in terms 
of paddle wheels that was proposed in Chapter 4. Paraphrasing Theorem 6, we have 


[ftcutty-nas = [[(cun'F)-ds= f F-ds= f Fras, 


where Fx is the tangential component of F. This says that the integral of the normal 
component of the curl of a vector field over an oriented surface S is equal to the line 
integral of F along aS, which in turn is equal to the path integral of the tangential 
component of F over aS. 

Suppose V represents the velocity vector field of a fluid. Consider a point P anda 
unit vector n. Let S, denote the disc of radius o and center P, which is perpendicular to 
n. By Stokes’ theorem, 


1 culw-ds= |f curl Ven ds = f V-ds, 
Sp Sp 3p 


where S, has the orientation induced by n (see Figure 8.2.5). 
By the mean-value theorem for integrals (Exercise 16, Section 7.6), there is a point 
Q in S, such that 


JI, curl V -ndS = [curl V(Q) -n]A(S,), 


where A(S,) = zp? is the area of S, and curl V(Q) is the value of curl V at Q. Thus, 


limit ——— ae “ds = limit z757 sy If (curl V)- 
p>0 A(S 


= aia vio} -n = curl V(P)-n 
p> 
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figure 8.2.6 The intuitive 
meaning of the possible signs of 


JN -ds. 


figure 8.2.5 A normal n induces an orientation on 
the boundary aS, of the disc S,. 


as, 
M v 
v 
C C C 
[N is > 0 [V as< o0 [.V-as=0 
Thus,! 
curl V(P)-n = imie- V-d (5) 
al p>0 A(S,) Sp Gi 


L et us pauseto consider the physical meaning of {. V -dswhen V isthe velocity field 
of a fluid. Suppose, for example, that V points in the direction tangent to the oriented 
curve C (Figure 8.2.6). Then clearly fẹ V -ds > 0, and particles on C tend to rotate 
counterclockwise. If V is pointing in the opposite direction, then fe V -ds < 0 and 
particles tend to rotate clockwise. If V is perpendicular to C , then particles don’t rotate 
on C atall and fẹ V -ds = 0. In general, fẹ V -ds, being the integral of the tangential 
component of V , represents the net amount of turning of the fluid in a counterclockwise 
direction around C . We therefore refer to fe V -ds as the circulation of V around C 
(see Figure 8.2.7). 

These results allow us to see just what curl V means for the motion of a fluid. The 
circulation f V-dsis the net velocity of the fluid around 0S,, so that (curl V)-n 
represents the turning or rotating effect of the fluid around the axis n. 


Circulation and Curl The dot product of curl V(P) with a unit vector n, 
namely, curl V(P) - n, equals the circulation of V per unit area at P on a surface 
perpendicular to n. 


1Some informal texts adopt equation (5) as the definition of the curl, and use it to “prove” Stokes’ 
theorem. However, this raises the danger of circular reasoning, for to show that equation (5) really 
defines a vector “curl V(P)” requires Stokes’ theorem, or some similar argument. 


figure 8.2.7 Circulation of a 
vector field (velocity field of a 
fluid): (a) Circulation about Cis 
zero; (b) nonzero circulation 
about C (“whirlpool”). 


example 4 


solution 
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Motion 
of fluid 
y particles 


Motion 
of fluid 
particles 


\ 


(a) (b) 


Observe that the magnitude of curl V(P) -n is maximized when n = curl V /|icurl V || 
(evaluated at P). Therefore, the rotating effect at P is greatest about the axis that is 
parallel to curl V/||curl V |j. Thus, curl V is aptly called the vorticity vector. 

We can use these ideas to compute the curl in cylindrical coordinates. 


Let the unit vectors @, &, @, associated with cylindrical coordinates be as shown in 
Figure 8.2.8. Let F = F,e +F,e, + Fe. (The subscripts here denote components of 
F, not partial derivatives.) Find a formula for the ẹ component of V x F in cylindrical 
coordinates. 


figure 8.2.8 Orthonormal vectors €r, @g, and ez 
associated with cylindrical coordinates. The vector e, 
is parallel to the line labeled r. 


<y 


Let S be the surface shown in Figure 8.2.9. 


“A 


figure 8.2.9 A surface element in cylindrical 
coordinates. 
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figure 8.2.10 Orthonormal 
vectors €p, €g, aNd € 
associated with spherical 
coordinotes. 


Theareaof S isr d8 dz and the unit normal is e.The integral of F around the edges 
of S is approximately 
[Fo(r, 0,z) — Felr,0,z+dz)]r dð +[F:(r, 0 +d6,z) — F,(r, 0, z)]dz 
OF o 


F 
ww o dorda+ "2 dode. 
ðZ 00 


Thus, the circulation per unit area is this expression divided by r dê dz, namely, 
10F, OF» 


ra az” 
According to the previous box, this must be the ẹ component of the curl. A 


Gradient, Divergence, and Curl in Cylindrical 
and Spherical Coordinates 


By similar arguments to Example 4, we find that the curl in cylindrical coordinates is 
given by 


e re & 

VxF ae 
ar 00 az 
Fy Fy Fz 


Wecan find other important vector quantities expressed in different coordinate systems. 
For example, the chain rule shows that the gradient in cylindrical coordinates is 
of 1 of of 


Vf = ; 
retrata? 


and in Section 8.4 we will establish related techniques that give the following formula 
for the divergence in cylindrical coordinates: 


1fa Fy 9 
V-F=_|—(rF le Ss a | =| 
r ar | r) + 30 Fa 2) 


Corresponding formulas for gradient, divergence, and curl in spherical coordinates are 


deler ug 
a’ noo sing 90” 
13,5 S 1 Fo 
N E ees F ee 
var a ap? Fel + Tong apne ol + Tone a0 
and 
1 a,, 1 Fy 
a | eee we Fj) = ee 
ER E 0) aaa |® 
1 a, 1a 1a 1 oF 
- F F 2 ; 
sana 00 p" ale [agt ol p og ü 


where e,, €, & are as shown in Figure 8.2.10 and where F = F e, + F g€ + Fo@. 


example 5 


IAI LEG 


figure 8.2.12 
The falling cat 
rights itself by 
wriggling its 
body parts. 


solution 


8.2 Stokes’ Theorem 449 


Faraday's Law 
Vector calculus plays an essential role in the theory of electromagnetism. The next 
example shows how Stokes’ theorem applies. 


Let E and H be time-dependent electric and magnetic fields, respectively, in space. L et 
S be a surface with boundary C . We define 


| E -ds = voltage around C, 

Ç 

J H -dS = magnetic flux across S. 
S 


Faraday’s law (see Figure 8.2.11) states that the voltage around C equals the negative 
rate of change of magnetic flux through S. Show that Faraday’s law follows from the 
following differential equation (one of the M axwell equations): 


vero". 
at 


=E figure 8.2.11 Faraday's law. 


— 
=H 
mei 


Assume that —dH /at = V x E holds. By Stokes’ theorem, 


[e-ds= [ftv xe) as. 


Assuming that we can move 3/ət under the integral sign, we get 


-2 ffu as= [[-0s= [[ivxe)-ds= [E-ds 
fE-as=- ffHas, 


which is Faraday's law. A 


and so 


Falling Cats and Stokes’ Theorem 


Have you ever wondered how a falling cat can right itself? Released from a resting 
position with its feet above its head, the cat is able to execute a 180° reorientation and 
land safely on its feet. This well-known phenomenon has fascinated people for many 
years— especially in cities like New York, where cats have been known to survive falls 
of 8 to 30 stories! 

There have been many incorrect explanations as to how cats are able to right them- 
selves, including the idea that it has to do with how the cat twirls its tail. This cannot 
be right because M anx cats, which have no tails, can also perform this feat! 

We observe, as in Figure 8.2.12, that the cat achieves this net change in orientation by 
wriggling, to create changes in its internal shape or configuration. On the surface, this 
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provides a seeming contradiction; because the cat is dropped from a resting position, 
it has zero angular momentum at the beginning of the fall and hence, according to 
a basic law of physics called conservation of angular momentum, the cat has zero 
angular momentum throughout the duration of its fall.? A mazingly, the cat has effectively 
changed its angular position while maintaining zero angular momentum! 

The exact process by which this occurs is subtle; intuitive reasoning can lead one 
astray and, as we have indicated, many false explanations have been offered throughout 
the history of trying to solve this mystery.? Recently, new and interesting insights have 
been discovered using geometric methods that, in fact, are related to curvature (see 
Section 7.7).* 

The way that curvature and geometry are related to the falling cat phenomenon is 
not easy to explain in full detail, but we can explain a similar phenomenon that is easy 
to understand. Interestingly, Stokes’ theorem is the key to understanding these types of 
phenomena. 


exercises 
1. Let S be the portion of the plane 2x + 3y +z = 5 lying 2. Let S be the portion of the surface z = x2 + y? lying 
between the points (—1, 1, 4), (2, 1, —2), (2, 3, —8), and between the points (0, 0, 0), (2, 0, 4), (0, 2, 4), and 
(—1, 3, 2). Find parameterizations for both the surface S (2, 2, 8). Find parameterizations for both the surface S 
and its boundary aS. Be sure that their respective and its boundary aS. Be sure that their respective 
orientations are compatible with Stokes’ theorem. orientations are compatible with Stokes’ theorem. 


In Exercises 3 to 6, verify Stokes’ theorem for the given surface S and boundary aS, and vector fields F. 


3. 


S = {(x, y, z) : x? +y? +z =1, z>0)} 6. S asin (3), and F = z?i + xj + y2k 
(oriented as a graph) 


a = {(x, y) : xX? +y? =1} 
F = xi + yj + zk 


7. LetC bethe closed, piecewise smooth curve formed by 
traveling in straight lines between the points (0, 0, 0), 
(2, 0, 4), (3, 2, 6), (1, 2, 2), and back to the origin, in 


4. Sasin (1), and F = yi + zj + xk that order. (Thus the surface S lying interior to C is 
= te! 2 <2 contained in the plane z = 2x.) Use Stokes’ theorem to 
2 SA ett ln 2a evaluate the integral: 
(oriented as a graph) 
a = {(x,y):x2+y2=]} 
F = zi + xj + (22x + 2xy)k fe cos x) dx + (x?yz) dy + (yz) dz 
c 


We saw an instance of the law of conservation of angular momentum in Section 4.1, Exercise 26. 


3A nother favorite fallacious argument, showing that a cat cannot turn itself over(!), is this: “A ccept from 
physics that angular momentum is the moment of inertia times angular velocity [moments of inertia 
are discussed in Section 6.3]. But the angular momentum of the cat is zero, so the angular velocity 
must also be zero. Because angular velocity is the rate of change of the angular position, the angular 
position is constant. Thus, the cat cannot turn itself over.” What is wrong? This argument ignores the 
fact that the cat changes its shape, and hence its moment of inertia, during the fall. 


4SeeT.R.KaneandM. Scher, “A Dynamical Explanation of the Falling Cat Phenomenon,” Int. J. Solids 
Struct., 5 (1969): 663-670. Seealso R. Montgomery, “Isoholonomic Problems and SomeA pplications,” 
Commun. Math. Phys., 128 (1990): 565-592; R. Montgomery, “How Much Doesa Rigid Body Rotate? 
A Berry's Phase from the 18" Century,” Am. J. Phys., 59 (1991b): 394-398. See also J. E. Marsden 
and J. Ostrowski, “Symmetries in M otion: Geometric Foundations of M otion Control,” Nonlinear Sci- 
ence Today (1998), http://link.springer-ny.com; R. Batterman, “Falling Cats, Parallel Parking, and Po- 
larized Light,” Philos. Soc. Arch. (2002); http://philsci-archive.pitt.edu/documents/disk0/00/00/05/83, 
http://www.its.caltech.edu/~mleok/falling_cats.htm, and references therein. 
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12. 


13. 


14. 


15. 


» LetC bethe closed, piecewise smooth curve formed by 


traveling in straight lines between the points (0, 0, 0), 
(2, 1,5), (1, 1, 3), and back to the origin, in that order. 
Use Stokes’ theorem to evaluate the integral: 


J OYA d+ boy) ay + (0 dz 
Cc 


. Redo Exercise 9 of Section 7.6 using Stokes’ theorem. 


Redo Exercise 10 of Section 7.6 using Stokes’ theorem. 


. Verify Stokes’ theorem for the upper hemisphere 


z= 4/1 — x? — y2, z > 0, and the radial vector field 
F(x, y,z) =xi+ yj + zk. 


Let S be a surface with boundary 0S, and suppose E is 
an electric field that is perpendicular to aS. Show that 
the induced magnetic flux across S is constant in time. 
(HINT: Use Faraday’s law.) 


Let S be the capped cylindrical surface shown in 
Figure 8.2.13. S is the union of two surfaces, Sı and S2, 
where S1 is the set of (x, y, z) with x? + y? = 1, 

0 <z <1, and Sp is the set of (x, y, z) with 

x2 +y? +(z-— 1)? =1,2> 1. Set F(x, y, z) = 

(zx +22y + x)i + (z3yx + y)j + 24x2k. Compute 
Shy x F) - dS. (HINT: Stokes’ theorem holds for this 
surface.) 


16. 


17. 


18. 


19. 


20. 
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x2 +y? +z? = land x + y +z > 1, and where 
F =r x (it+j+k),r=xi+yj+zk. 


Show that the calculation in Exercise 15 can be 
simplified by observing that f, F -dr = f. F -dr for 
any other surface £. By picking © appropriately, 

Jf CV x F) -dS may be easy to compute. Show that this 
is the case if © is taken to be the portion of the plane 

X +y +Z=1 inside the circle aS. 


Calculate the surface integral ff (V x F)-dS, where S 
is the hemisphere x?+y?+z? = 1, x > 0, and 
F = x3j— y3j. 


Find ff CV x F) -+ dS, where S is the ellipsoid 


x2 + y? + 22? = 10 and F is the vector field 
F = (sinxy)i+ e*j — yzk. 


Let F = yi — xj + zx3yk. Evaluate 
JstY x F)-ndA, where S is the surface defined by 
x? + y? 4+ 22=1,2<0. 


A hot-air balloon has the truncated spherical shape 
shown in Figure 8.2.14. The hot gases escape through 
the porous envelope with a velocity vector field 


V(x, y, Zz) = V x ®(X, y, 2) 
where ®(x, y,z) =—yi+ xj. 


If R = 5, compute the volume flow rate of the gases 
through the surface. 


figure 8.2.13 The capped cylinder is the 
union of Sı and &. 


Let c consist of straight lines joining (1, 0, 0), (0, 1, 0), 


and (0, 0, 1) and let S be the triangle with these vertices. 


Verify Stokes’ theorem directly with 
F = yzi + xzj + xyk. 


Evaluate the integral ff (V x F) -dS, where S is the 
portion of the surface of a sphere defined by 


21. 
22. 


figure 8.2.14 A hot-air balloon. 


Prove that Faraday’s law implies V x E = —dH /ot. 


Let S be a surface and let F be perpendicular to the 
tangent to the boundary of S. Show that 


J| xm as=0. 
S 


W hat does this mean physically if F is an electric field? 
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23. 


24, 


25. 


26. 


27. 


28. 


29. 


30. 
31. 
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Consider two surfaces S1, S2 with the same boundary 
aS. Describe with sketches how Sı and S must be 
oriented to ensure that 


J| v xmas= f (V x F)-dS. 
Si S2 


For a surface S and a fixed vector v, prove that 


2[[vends= f ivxn-ds 
S as 


where r(x, y, z) = (x, y, 2). 


Argue informally that if S is a closed surface, then 


[fv xm as=0 
S 


(see Exercise 23). (A closed surface is one that forms 
the boundary of a region in space; thus, for example, a 
sphere is a closed surface.) 


If C isa closed curve that is the boundary of a surface S, 
and f and g are C ? functions, show that 


© f tva-as= ftvt x voras 
c 
S 


(b) Jiva +ovt)-ds=0 
c 


(a) If C isa closed curve that is the boundary of a 
surface S, and v is a constant vector, show that 


[vaso 
Ç 


(b) Show that this is true even if C is not the boundary 
of a surface S. 


Show that 6: D > R3, D = [0, z] x [0, 27], 
(p, 6) = (cos sin o, sin@ sing, cos), which 
parametrizes the unit sphere, takes the boundary of 
D to half of a great circle on S. 


Verify Theorem 6 for the helicoid 
(r, 8) =(rcosé,r sing, 4), (r, 0) € [0, 1] x [0, 7/2], 
and the vector field F(x, y, z) = (z, x, y). 


Prove Theorem 6. 
Let F = xĉi + (2xy + x)j + zk. LetC bethe circle 


x? + y? = Land S the disc x? + y? < 1 within the 
plane z = 0. 


32. 


33. 


34. 


35. 


36. 


(a) Determine the flux of F out of S. 
(b) Determine the circulation of F around C. 


(c) Find the flux of V x F. Verify Stokes’ theorem 
directly in this case. 


Let S be a surface with boundary aS, and suppose that E 
is an electric field that is perpendicular to aS. Use 
Faraday's law to show that the induced magnetic flux 
across S is constant in time. 


Integrate V x F, F = (3y, —xz, —yz2) over the portion 
of the surface 2z = x? + y? below the plane z = 2, both 
directly and by using Stokes’ theorem. 


Ampere'’s law states that if the electric current density is 
described by a vector field J and the induced magnetic 
field is H, then the circulation of H around the boundary 
C of asurface S equals the integral of J over S (i.e., the 
total current crossing S). See Figure 8.2.15. Show that 
this is implied by the steady-state M axwell equation 
VxH=J. 


Current J = flux 
of J 


p” 


figure 8.2.15 Ampére’s law. 


Faraday’s law relates the line integral of the electric field 
around aloop C to the surface integral of the rate of 
change of the magnetic field over a surface S with 
boundary C . Regarding the equation V x E = —dH /ot 
as the basic equation, Faraday’s law is a consequence of 
Stokes’ theorem, as we have seen in Example 4. 
Suppose we are given electric and magnetic fields 
in space that satisfy V x E = —dH/ot. Suppose C is 
the boundary of the M dbius band shown in Figures 7.6.3 
and 7.6.4. Because the M obius band cannot be oriented, 
Stokes’ theorem does not apply. W hat becomes of 
Faraday's law? W hat do you guess Je E -ds equals? 


(a) If in spherical coordinates, we write 
& = ai + Aj + yk, finda, £, and y. 


(b) Find similar formulas for eg and ey. 
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8.3 Conservative Fields 


We saw in Section 7.2 that for a gradient force field F = Vf, line integrals of F were 
evaluated as follows: 


fF -ds= f(c(b)) — f(e(a)). 


The value of the integral depends only on the endpoints ¢(b) and c(a) of the path. In 
other words, if we used another path with the same endpoints, we would still get the 
same answer. This leads us to say that the integral is path-independent. 

Gradient fields are important in many physical problems. For example, if V = — f 
represents a potential energy (gravitational, electrical, and so on), then F represents a 
force. Consider the example of a particle of mass m in the field of the earth; in this 
case, we take f tobeGmM /r or V = —GmM /r, whereG is the gravitational constant, 
M is the mass of the earth, and r is the distance from the center of the earth. The 
corresponding force is F = —(GmM /r?)r = —(GmM /r?)n, where n is the unit radial 
vector. Note that F fails to be defined at the pointr = 0. 


When Are Vector Fields Gradients? 


We wish to characterize those vector fields that can be written as a gradient. Our task is 
simplified considerably by Stokes’ theorem. 


Theorem 7 Conservative Fields Let F be aC?! vector field defined on R?, 
except possibly for a finite number of points. The following conditions on F are 
all equivalent: 


(i) For any oriented simple closed curve C, fe F-ds=0. 
(ii) For any two oriented simple curves C ı and C 2 that have the same endpoints, 
F-ds= / F-ds. 
Ci C2 


(iii) F is the gradient of some function f; that is, F = Vf (and if F has one or 
more exceptional points where itfailsto bedefined, f isalso undefined there). 


(iv) Vx F=0. 


A vector field satisfying one (and, hence, all) of the conditions (i)- (iv) is called 
a conservative vector field.° 


>If the minus sign is used, then V is decreasing in the direction F. 

5In the plane R?, exceptional points are not allowed (see Exercise 16). Theorem 7 can be proved in the 
same way if F is defined and is of class C ! only on an open convex set in IR? or R?. (A set D is convex 
if P,Q e D implies the line joining P and Q also belongs to D.) 
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figure 8.3.1 Constructing (a) an 
oriented simple closed curve 

©; — C, from (b) two oriented 
simple curves. 


¢(b) = €2(b) 


c=C;— Cy c 


(a) = €2(a) 


(a) (b) 


proof We shall establish the following chain of implications, which will prove the 
theorem: 


(i) > (ii) > (iii) > (iv) = (i). 


First we show that condition (i) implies condition (ii). Suppose cı and cz are para- 
metrizations representing Cı and C2, with the same endpoints. Construct the closed 
curve c obtained by first traversing cı and then —c, (Figure 8.3.1), or, symbolically, the 
curve € = C; — Cz. Assuming cis simple, condition (i) gives 


fF-ds= f F-ds— f F-ds=0, 
c C C2 


and so condition (ii) holds. (If c is not simple, an additional argument, omitted here, is 
needed.) 

Next, we provethat condition (ii) implies condition (iii). LetC be any oriented simple 
curve joining a point such as (0, 0, 0) to (x, y, z), and suppose C is represented by the 
parametrization c [if (0, 0, 0) is the exceptional point of F, we can choose a different 
starting point for c without affecting the argument]. Define f(x, y, z) to be f, F - ds. By 
hypothesis (ii), f(x, y, z) is independent of C . We shall show that F = grad f. Indeed, 
choose c to be the path shown in Figure 8.3.2, so that 


x y z 
Fix y2) = f Falt, 0,0)at+ f F(x, t, 0) d+ f F3(x, y, t) dt, 
0 0 0 


where F = (Fi, F2, F3). 

It follows from the fundamental theorem of calculus that əf /az = F3. We can repeat 
this processing using two other paths from (0, 0, 0) to (x, y, z) [for example, by drawing 
the lines from (0, 0, 0) to (0, y, 0) to (x, y, 0) to (x, y, z)], and we can similarly show 
that af /ax = Fı and of /ay = Fz (see Exercise 26). Thus, Vf = F. 

Third, condition (iii) implies condition (iv), because, as proved in Section 4.4, 


Vxvf =0. 


figure 8.3.2 A path joining (0, 0, 0) to (x, y, 2). 
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figure 8.3.3 A surface S spanning a curve C. 


Finally, let c represent a closed curve C and let S be any surface whose boundary is 
c (if F has exceptional points, choose S to avoid them). Figure 8.3.3 indicates that we 
can probably always find such a surface; however, a formal proof of this would require 
the development of more sophisticated mathematical ideas than we can present here. 
By Stokes’ theorem, 


[F-as= [F-as= [ftv xr)-nds = f ficu Finas. 


Because V x F = 0, this integral vanishes, so that condition (iv) > condition (i). ™ 


Physical Interpretations of fcF- ds 


We have already seen that one interpretation of the line integral is as the work done by F 
in moving a particle along C . A second interpretation is the notion of circulation, which 
we encountered at the end of the last section. Recall that in this case, we think of F as 
the velocity field of a fluid; that is, to each point P in space, F assigns the velocity vector 
of the fluid at P (in the last section, F was designated V). Take C to be a closed curve, 
and let As be a small directed chord of C . Then F - Asis approximately the tangential 
component of F times ||Asj|. The circulation fẹ F -ds is the net tangential component 
around C. A small paddle wheel placed in the fluid would rotate if it is centered at a 
point where F vanishes and if the circulation of the fluid is nonzero, or fẹ F -ds # 0 
for small loops C (see Figure 8.3.4). 

Thereis a similar interpretation in electromagnetic theory: If F represents an electric 
field, then a current will flow around a loop C if fe F-ds# 0. 

By Theorem 7, a field F has no circulation if and only if curl F = V x F = 0. Hence, 
a vector field F with curl F = Ois called irrotational. We have therefore proved that a 
vector field in R? is irrotational if and only if it is a gradient field for some function, 
that is, if and only if F = Vf. The function f is called a potential for F. 


figure 8.3.4 J-F- ds 4 0 implies that a paddle 
wheel in a fluid with velocity field F will rotate 
around its axis. 
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example | Consider the vector field F on R? defined by 
F(x, y,z) = yi + (zcos yz + x)j +(ycosyz)k. 
Show that F is irrotational and find a scalar potential for F. 


solution We compute V x F: 


i j k 
VxF= fa 2 KA 
OX oy OZ 

y X +ZcOS yz y COS yZ 


= (cos yz — yzsinyz — cos yz + yzsinyz)i+(0—0)j + (1 — 1)k 
= 0i + 0j + 0k = 0, 


so F isirrotational. Thus, a potential exists by Theorem 7. We can find itin several ways. 
Method 1. By the technique used to prove that condition (ii) implies condition (iii) 
in Theorem 7, we can set 


X y Z 
f(x,y,z) =i F x(t, 0, 0) d+ f F-(x, t, 0) at f F3(x, y,t) dt 
0 0 0 


X y z 
a. oat+ f xat+ f y cos yt dt 
0 0 0 


= 0 + xy + sin yz = xy + sin yz. 


We easily verify that Vf = F, as required: 


əf. əf. af . A 
yf = x t gy! t gy k= Yit (*+zcosyz)j + (y cos yzk. 


Method 2. Because we know that f exists, we know that we can solve the system 
of equations 


of 


= il =X+ZcCOSYZ à = Y COS yZ 
zxr a yz, əz y yz, 


ay 7 

for f(x, y, z). These are equivalent to the simultaneous equations 
(a) f(x,y,z) = xy + ily, z) 

(b) f(x, y,z) =sinyz+xy + h2(x, z) 


(c) f(x, y,z) =sinyz +h3(x, y) 
for functions hı, h2, ha independent of x, y, and z (respectively). When hi(y,z) = 
sin yz, h2(x, z) = 0, and h3(x, y) = xy, the three equations agree and so yield a 
potential for F. However, we have only guessed at the values of hi, h2, and h3. To derive 
the formula for f more systematically, we note that because f(x, y, z) = xy +h:(y, Zz) 
and af /dz = y cos yz, we find that 


ahi y, Z) 
0z 


= y cos yz 


example 2 


solution 
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or 
hi(y,z) = J ycosyeae +g(y) =sinyz+g(y). 
Therefore, substituting this back into equation (a), we get 
f(x, y, z) = xy +sinyz+9(y); 
but by equation (b), 
g(y) = ha(x, z). 


Because the right side of this equation is a function of x and z and the left side is a 
function of y alone, we conclude that they must equal some constant C . Thus, 


f(x,y,z) =xy+sinyz+C 


and we have determined f up to a constant. A 


A mass M at the origin in R? exerts a force on a mass m located at r = (x, y, z) 
with magnitude GmM /r? and directed toward the origin. Here, G is the gravitational 
constant, which depends on the units of measurement, andr = ||r|| = \/x2 + y? +22. 
If we remember that —r/r is a unit vector toward the origin, then we can write the force 
field as 


GmMr 


F(x, y,2) =-—3 


Show that F is irrotational and find a scalar potential for F. (Notice that F is not defined 
at the origin, but Theorem 7 still applies, because it allows an exceptional point.) 


Firstlet us verify that V x F = 0. Referring to formula 10 in the table of vector identities 


in Section 4.4, we get 
1 1 


But V(1/r3) = —3r/r? (see Exercise 38, Section 4.4), and so the first term vanishes, 
because r x r = 0. The second term vanishes, because 


F = —GmM 


i j k 
Teral =. 2 - (= ris (Z-Z) (-S) kno 
Ox dy dz əy az ðZ = OX ox oy 
X y Z 


Hence, V x F = 0 (for r 40). 

If we recall the formula V(r") = nr"~@r (again, see Exercise 38, Section 4.4), then 
we can read off a scalar potential for F by inspection. We have F = —VV, where 
V(x, y,Z) = —GmM /r is called the gravitational potential energy. 

[We observe in passing that by Theorem 3 of Section 7.2, the work done by F in 
moving a particle of mass m from a point P, to a point P2 is given by 


where ry, is the radial distance of Pı from the origin, with r2 similarly defined.] A 
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example 3 


solution 


The Planar Case 


By thesame proof, Theorem 7 is also truefor C ! vector fields F on R°. In this case, we re- 
quire that F has no exceptional points; that is, F is smooth everywhere (see E xercise 16). 
Notice, however, that the conclusion might still hold even if there are exceptional points, 
an example being (xi + yj)/(x? + y?)?/2. An example where the conclusion does not 
hold is (—yi + xj) /(x? + y?), as shown in Exercise 16. 

If F = Pi+ Qj, then 


Sometimes əQ /dx — dP /dy is called the scalar curl of F. Therefore, the condition 
V x F = 0 reduces to 


aP _ Q 
əy ax’ 
Thus, we have: 


Corollary 1 F is aC? vector field on R? of the form Pi + Qj that satisfies 
aP /ay = 8Q /ax, then F = Vf for some f on R?. 


We emphasize again that this corollary can be false if F fails to be of class C * at even 
a single point (an example is given in Exercise 16). In R?, however, as already noted, 
exceptions at single points are allowed (see Theorem 7). 


(a) Determine whether the vector field 
F=e9i+ enj 

is a gradient field. 

(b) Repeat part (a) for 

F = (2x cos y)i — (x? sin y)j. 
(a) Here P (x, y) = e* and Q(x, y) = e*tY, and so we compute 
= xe”, a = 0, 
ax 

These are not equal, and so F cannot have a potential function. 

(b) In this case, we find 
Ts 2x sin y = x 
ay Y= ox" 
and so F has a potential function f. To compute f we solve the equations 


a = oye T agen 
ax y: ay y- 


Thus, f(x, y) =x? cosy +hı(y)and f(x, y) = x? cos y +h2(x). If hı and hz are the 
same constant, then both equations are satisfied, and so f (x, y) = x? cos y is a potential 
forF. A 
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example 4 | Letc: [1,2] — R? be given by x =e}, y = sin(z/t). Compute the integral 
JF-ds= [2xcosy dx —xsiny dy, 
c c 


where F = (2x cos y)i — (x? sin y)j. 


solution | The endpoints are c(1) = (1,0) and ¢(2) = (e, 1). Because (2x cosy)/ay = 
d(—x? sin y)/əx, F is irrotational and hence a gradient vector field (as we saw in Ex- 
ample 3). Thus, by Theorem 7, we can replace c by any piecewise C ! curve having the 
same endpoints, in particular, by the polygonal path from (1, 0) to (e, 0) to (e, 1). Thus, 
the line integral must be equal to 


e 4 
[F-s= f 2tcoso dt+ f — e? sint dt = (e? — 1) + e?(cos 1 — 1) 
c 1 0 


= e’cosl—1. 
Alternatively, using Theorem 3 of Section 7.2, we have 
fa cos y dx — x? sin y dy = fxi -ds= f(c(2)) — f(e(1)) =e*cos1 —1, 
c c 


because f(x, y) = x? cos y is a potential function for F. Evidently, this technique is 
simpler than computing the integral directly. A 


We conclude this section with a theorem that is quite similar in spirit to Theorem 7. 
Theorem 7 was motivated partly as a converse to the result that curl Vf = 0 for any C? 
function f: R? —> R— or, if curl F = 0, then F = Vf. We also know [formula (9) in 
the table of vector identities in Section 4.4] that div(curl G) = 0 for any C ? vector field 
G. We can ask about the converse statement: If div F = 0, is F the curl of a vector field 
G? The following theorem answers this in the affirmative. 


Theorem 8 If F isaC? vector field on all of R? with div F = 0, then there 
exists a C 1 vector field G with F = curl G. 


The proof is outlined in Exercise 20. We should warn you at this point that, unlike the 
F in Theorem 7, the vector field F in Theorem 8 is not allowed to have an exceptional 
point. For example, the gravitational force field F = —(GmM r/r?) has the property 
that div F = 0, and yet there is no G for which F = curl G (see Exercise 29). Theorem 
8 does not apply, because the gravitational force field F is not defined at 0 € R?. 


exercises 
1. Determine which of the following vector fields F in the 2. Repeat Exercise 1 for the following vector fields: 
plane is the gradient of a scalar function f. If such an f i F 20 F 
exists, find it. (a) F(x, y) = (cosxy — xy sinxy)i — (x? sinxy)j 


E E T (b) F(x, y) = (Xy/x2y? + Di +(yyx?y? + Dj 


= i—(x2si ivii 
(b) F(x, y) = xyi + xyj (c) F(x, y) = (2x cos y + cos y)i — (x4 sin y +x sin y)j 


(c) F(x, y) = (x? + y?)i+ 2xyj 
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3. 


10. 


11. 


12. 
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For each of the following vector fields F, determine (i) if 
there exists a function g such that Vg = F, and (ii) if 
there exists a vector field G such that curl G = F. (Itis 
not necessary to find g or G.) 
(a) F(x, y, z) = (4xz — x, —4yz, z — 2y) 
(b) F(x, y, z) = (e% sin y, e* cos y, z?) 
(c) F(x, y, 2) = (log(z? +1) + yê, 2xy, #5 
(d) F(x, y, z) = (x? + x sinz, y cosz — 2xy, 

cosz + sinz) 


.« For each of the following vector fields F, determine (i) if 


there exists a function g such that Vg = F, and (ii) if 
there exists a vector field G such that curl G = F. (Itis 
not necessary to find g or G.) 


(a) F(x, y, z) = (e¥ cosy, —e* sin y, x) 


= y x —2xyz 
(b) F(x, y, z) = (wa 7744" e416) 


(c) F(x, y, z) = (x2y2z?, ye*, xy cosz) 
(d) F(x, y, z) = (6z°y>, 9x8z?, 4x3y3) 


» Show that any two potential functions for a vector field 


on R? differ at most by a constant. 


. (a) LetF(x, y) = (xy, y?) and let c be the path y = 2x? 


joining (0, 0) to (1, 2) in R?. Evaluate f. F- ds. 


(b) Does the integral in part (a) depend on the path 
joining (0, 0) to (1, 2)? 


. Let F(x, y, z) = (2xyz + sin x)i + x2zj + x2yk. Finda 


function f such that F = Vf. 


. Evaluate f F- ds, where c(t) = (cost, sin? t, t4), 


0 <t < z, and F is asin Exercise 7. 


» If f(x) is a smooth function of one variable, must 


F(x, y) = f(x)i4+ f(y)j bea gradient? 
(a) Show that F = —r/|r||? is the gradient of 
f(x, y,z) =1/r. 


(b) What is the work done by the force F = —r/|r||> in 
moving a particle from a point ro € R? “to oo,” 
where r(x, y, z) = (x, y, z)? 


Let F(x, y, z) = xyi + yj + zk. Can there exist a 
function f such that F = Vf? 


Let F = F1i + F2j + F3k and suppose each Fx satisfies 
the homogeneity condition 

k= 12,2: 
Suppose also V x F = 0. Prove that F = Vf, where 


Fx(tx, ty, tz) = tFk(x, y, Z), 


2f(x, y, z) =xF1(x, y, z)+yF2(x, y, z)+zF3(x, y, 2). 


[HINT: Use Review Exercise 31, Chapter 2.] 


13. 


14. 


15. 


16. 


17. 


18. 


19. 


Let F(x, y, z) = (e* sin y)i + (e¥ cos y)j + 22k. 
Evaluate the integral he F - ds, where 
c(t) = (VE, t3, exp st), 0 <t <1. 


Leta fluid have the velocity field F(x, y, z) = 
xyi + yzj + xzk. W hatis the circulation around the unit 
circle in the xy plane? Interpret your answer. 


The mass of the earth is approximately 6 x 1027 g and 
that of the sun is 330,000 times as much. The 
gravitational constant is 6.7 x 1078 cm3/s? - g. The 
distance of the earth from the sun is about 1.5 x 10!2 cm. 
Compute, approximately, the work necessary to increase 
the distance of the earth from the sun by 1 cm. 


(a) Show that fe (x dy — y dx)/(x? + y?) = 2x, where 
C isthe unit circle. 


(b) Conclude that the associated vector field 
[—y/(x? + y?)li + [x/(x? + y2)]j is nota 
conservative field. 


(c) Show, however, that aP /ay = aQ /ax. Does this 
contradict the corollary to Theorem 7? If not, why 
not? 


Determine if the following vector fields F are gradient 
fields. If there exists afuction f such that V f = F, 
find f. 

(a) F(x, y, z) = (2xyz, x?z, x?y) 

(b) F(x, y) = (x cosy, x sin y) 

(c) F(x, y, z) = (x?eY, xyz, ež) 

(d) F(x, y) = (2x cosy, —x? sin y) 


Determine if the following vector fields F are gradient 
fields. If there exists a function f such that V f = F, 
find f. 

(a) F(x, y) = (2x + y? — y sin x, 2xyz + cosx) 

(b) F(x, y, z) = (6x?z2, 5x2y?, 4y?z?) 

(c) F(x, y) = (y? +1, 3xy? +1) 

(d) F(x, y) = (xet? + 2xy, yel’+¥’) 4 4y3z, y4) 


Show that the following vector fields are conservative. 
Calculate fe F -ds for the given curve. 


(a) F = (xy? + 3x?y)i + (x + y)x2j;C is the curve 
consisting of line segments from (1, 1) to (0, 2) to 
(3, 0). 
2x |. 2y(x? +1). x ; , 
Pr ii (yep aye j; C is parametrized by 
x=t}-1ly=t®-t,0<t<l1. 


(c) F = [cos (xy?) — xy? sin (xy2)]i — 2x2y sin (xy?)j; 
C isthe curve (et, ett!) —1 < t <0. 


(b) F = 


20. 


21. 


22. 


23. 
24. 


25. 


26. 


Prove Theorem 8. [HINT: Define G = G1i+G2j + G3k 
by 


z y 
G(x, y,z) = i: Fo(x, y, t) a- f F3(x, t, 0) dt 
0 0 


Z 
G2(x, y, Z) = -f F(x, y, t) dt 
0 


and G3(x, y, z) = 0.] 


Is each of the following vector fields the curl of some 
other vector field? If so, find the vector field. 

(a) F = xi + yj + zk 

(b) F = (x? +1)i+ (z — 2xy)j + yk 


Let F = xzi — yzj + yk. Verify that V-F = 0. Find a G 
such that F = V x G. 


Repeat Exercise 22 for F = y?i + 22j + x?k. 


Let F = xei — (x cosz)j — zeYk. Find a G such that 
F=VxG. 


Let F = (x cos y)i — (sin y)j + (sin x)k. Find a G such 
tht F =V xG. 


By using different paths from (0, 0, 0) to (x, y, z), show 
that the function f defined in the proof of Theorem 7 for 
“condition (ii) implies condition (iii)” satisfies 

of/ax = Fy and of /ay = F2. 
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27. 


28. 


29. 
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Let F be the vector field on R? given by F = —yi + xj. 
(a) Show that F is rotational, that is, F is not 
irrotational. 


(b) Suppose F represents the velocity vector field of a 
fluid. Show that if we place a cork in this fluid, it 
will revolve in a plane parallel to the xy plane, ina 
circular trajectory about the z axis. 


(c) In what direction does the cork revolve? 


Let G be the vector field on R3\{z axis} defined by 


o = 4 : 

G = x2 y T xy) 
(a) Show that G is irrotational. 
(b) Show that the result of Exercise 27(b) holds for G 
also. 
How can we resolve the fact that the trajectories of 
F and G are both the same (circular about the z axis) 
yet F is rotational and G is not? [HINT: The property 
of being rotational is a local condition, that is, a 
property of the fluid in the neighborhood of a point.] 


~ 
EA 


Let F = —(GmM r/r?) be the gravitational force field 
defined on R?\ {0}. 


(a) Show that div F = 0. 


(b) Show that F 4 curl G for any C 1 vector field G on 
R3\ {0}. 


Gauss’ theorem states that the flux of a vector field out of a closed surface equals the 
integral of the divergence of that vector field over the volume enclosed by the surface. 
The result parallels Stokes’ theorem and Green's theorem in that it relates an integral 
over aclosed geometric object (curve or surface) to an integral over a contained region 


(surface or volume). 


Elementary Regions and Their Boundaries 


We shall begin by asking you to review the various elementary regions in space that 
were introduced when we considered the volume integral; these regions are illustrated in 
Figures 5.5.2 and 5.5.4. As these figures indicate, the boundary of an elementary region 
in R? is a surface made up of a finite number (at most six, at least two) of surfaces that 
can be described as graphs of functions from R? to R. This kind of surface is called a 


closed surface. The surfaces S4, S2, .. 


its faces. 


example | 


., Sn Composing such a closed surface are called 


The cubein Figure 8.4.1(a) is an elementary region, and in fact a symmetric elementary 


region, with six rectangles composing its boundary. The sphere in Figure 8.4.1(b) is the 
boundary of a solid ball, which is also asymmetric elementary region. 
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Outward normal 


Inward normal 


figure 8.4.2 Two possible 
orientations for a closed surface. 


example 2 


figure 8.4.1 (a) Symmetric elementary 


regions and (b) the surface 5; composing 
their boundaries. 
Si Sı 


(a) (b) A 


Closed surfaces can be oriented in two ways. The outward orientation makes the 
normal point outward into space, and the inward orientation makes the normal point 
into the bounded region (Figure 8.4.2). 

Suppose S is a closed surface oriented in one of these two ways and F is a vector 
field on S. Then, as we defined it in Section 7.6, 


[[F-as=% ff Fas. 


If S is given the outward orientation, the integral ff; F -dS measures the total flux of 
F outward across S. That is, if we think of F as the velocity field of a fluid, //,F-dS 
indicates the amount of fluid leaving the region bounded by S per unit time. If S is given 
theinward orientation, the integral ff; F -dS measures the total flux of F inward across S. 

Werecall another common way of writing these surface integrals, a way that explicitly 
specifies the orientation of S. Let the orientation of S be given by a unit normal vector 
n(x, y, z) at each point of S. Then we have the oriented integral 


[[F-as= [[te-mas, 


thatis, theintegral of the normal component of F over S. In the remainder of this section, 
if S is a closed surface enclosing a region W , we adopt the default convention that S = 
aW is given the outward orientation, with outward unit normal n(x, y, z) at each point 
(x, y, Z) € S. Furthermore, we denote the surface with the opposite (inward) orientation 
by AW op. Then the associated unit normal direction for this orientation is —n. Thus, 


[fF-9s= fE- mas =— f[tr-(-mas = ff aes 


The unit cube W given by 


0<x<l, 0<y<l, 0<z<l 


is asymmetric elementary region in space (see Figures 8.4.3 and 5.5.5). 
We write the faces as 


0, 0=x <1, 0<y<l 
1, 0<x <1, 0<y<l 
=0, 0<y<l, 0<z<l 
1; 0<y<l, 0<z<l 
0, 0<x <1, 0<z<l 
L 0<x<1, 0<z<l. 
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n; =—k 


figure 8.4.3 The outward orientation on the 
cube. 


From Figure 8.4.3, we see that 


n=k=-—-m, 
n, = i = —ns, 
ng = J = —Ns, 


and so for a continuous vector field F = Fii + Fj + F3k, 


a Feds = //F-nds=— [ Fads + f Fads — f| peas 
ow S Sı S2 S3 
+ [f tas ff trds+ ff Fras. a 
S4 S5 Se 


Gauss’ Theorem 


We have now come to the last of the three central theorems of this chapter. T his theorem 
relates surface integrals to volume integrals; in other words, the theorem states that if W 
is a region in R3, then the flux of a vector field F outward across the closed surface dW 
is equal to the integral of div F over W. We begin by assuming that W is asymmetric 
elementary region (Figure 5.5.5). 


Theorem 9 Gauss’ Divergence Theorem LetW beasymmetric elemen- 
tary region in space. Denote by aW the oriented closed surface that bounds W. 
Let F be a smooth vector field defined on W. Then 


[[ [ira = |/ Fas 
Jff a E as 


or, alternatively, 
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proof If F = Pi+ Qj-+ Rk, then by definition, the divergence of F is given by 
div F = oP /ax + aQ /dy + dR/dz, so we can write (using additivity of the volume 
integral) 


[ff ara = [ff Sav + Sff Pav+ Iz dR av. 
W Ww 
On the other hand, the surface integral in question is 
Ji Fonds = ff (Pi+ Qj +Rk)-nds 
aw aw 
= Jf Pinds + [/ Qj-nas + f| Rk-ndS. 
aw aw oW 


The theorem will follow if we establish the three equalities 


JI eeso fha (1) 
JE orms = ff 4 o 


ff Re-nas = fff Sav. (3) 


We shall prove equation (3); the other two equalities can be proved in an analogous 
fashion. 
Because W is asymmetric elementary region, there is a pair of functions 


and 


Z=91(x, y), Z = 92(x, y), 


with common domain an elementary region D in the xy plane, such that W is the set of 
all points (x, y, z) satisfying 


gi(x,y) <z < 92(x, y), (x, y)eD 


By reduction to iterated integrals, we have 


MESSE Fes) oxay 


and so, by the fundamental theorem of calculus, 


Wen a2 = fir Y, 92(X, y)) — R(X, y, galx, y))]dxdy. (4) 


The boundary of W is a closed surface whose top S2 is the graph of z = g2(Xx, y), 
where (x, y) € D, and whose bottom S; is the graph of z = gi(x, y), (x, y) € D. The 
four other sides of aW consist of surfaces S3, S4, Ss, and Sg, whose normals are always 
perpendicular to the z axis. (See Figure 8.4.4. Note that some of the other four sides 
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S4 


z = Bo% y) 


figure 8.4.4 A symmetric elementary region 
Wfor which ff Rk- as= fff (aR/az) av 
The four sides of dW, which are S3, Sa, 55, S% 
have normals perpendicular to the z axis. 


z= gi y) 
Sy 


might be absent; for instance, if W is a solid ball and aW is a sphere.) By definition, 


6 
J| rkenas = ff rk-mas+ ff Rk-mds+5> f| rk-mads. 
aw Sı s a oes 


Because the normal n; is perpendicular to k on each of $3, S4, Ss, Sg, we have k - n = 0 
along these faces, and so the integral reduces to 


// Rk-nds= f Rk-dSı+ f Rk- dS». (5) 
aw Sı S2 


The surface S; is defined by z = gi(x, y), and 


_ (2%; 2; 
asi = (2i Sy - k) ox6y 


(the negative of the general formula for dS for graphs from Section 7.6, because the 
normal is downward pointing). Therefore, 


| Rk-d8, = — | R(x, y, ga(x, y)) dxdy. (6) 
Sı D 


Similarly, for the top face S», 


Therefore, 
II Rk-dS2— f R(x, yY, ga(x, y)) dxdy. (7) 
2 D 


Substituting equations (6) and (7) into equation (5) and then comparing with equation (4), 


we obtain 
lls oh ly -Jf R(k-n) dS 


The remaining equalities, (1) and (2), can be established in the same way to complete 
the proof. E 
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example 3 


solution 


example 4 


figure 8.4.5 A more general region to which Gauss’ 
theorem applies. 


Generalizing Gauss’ Theorem 


The reader should note that the proof of Gauss’ theorem is similar to that of Green's 
theorem. By the procedure used in Exercise 16 of Section 8.1, we can extend Gauss’ 
theorem to any region that can be broken up into symmetric elementary regions. This 
includes all regions of interest to us. An example of a region to which Gauss’ theo- 
rem applies is the region between two closed surfaces, one inside the other. The sur- 
face of this region consists of two pieces oriented as shown in Figure 8.4.5. We shall 
apply the divergence theorem to such a region when we prove Gauss’ law in Theo- 
rem 10. 


Consider F = 2xi + y*j + z?k. Let S be the unit sphere defined by x? + y? +z? = 1. 
Evaluate ff, F-n dS. 


By Gauss’ theorem, 


[feenas= fff (ivr, 


where W is the ball bounded by the sphere. The integral on the right is 


2 Jff a+y+aa =2 ff] dV +2 fff yav +2 fff 2av. 


By symmetry, we can argue that [ffy ydV = fff zdV = 0 (see Exercise 21, Section 
6.3). Thus, because a sphere of radius R has volume 4x R?/3, 


2 fff a+y+aa =2 fff dy = 


You can convince yourself that direct computation of //,F-ndS proves 
unwieldy. A 


Use the divergence theorem to evaluate 


J| e +y+aas, 


where W is the solid ball x? + y? +z? < 1. 


solution 
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To apply Gauss’ divergence theorem, we find a vector field F = Fii + F2j + Fakon W 
with F -n = x? + y + z. Atany point (x, y, z) € dW, the outward unit normal n to aW 
is 

n= xi + yj +zk, 


because on WW, x? + y? + z? = 1 and the radius vector r = xi + yj + zk is normal to 
the sphere aW (Figure 8.4.6). 


n = xi + yj + zk 


figure 8.4.6 n is the unit normal to aW, the 
boundary of the ball W. 


Therefore, if F is the desired vector field, then 
F-n= Fix + Foy + Faz. 


We set 


Fix=x’?, Foy=y,  F3z=z 


and solve for Fı, F2, and F3 to find that F = xi +j + k. Computing div F, we get 
div F =1+04+0=1. 


Thus, by Gauss’ divergence theorem, 


Jf very+nds= fff dv = volume (W) = $x. á 


The Divergence as the Flux per Unit Volume 


The physical meaning of divergenceis that at a point P, div F (P) is the rate of net outward 
flux atP per unitvolume. This followsfrom Gauss’ theorem and the mean-value theorem 
for integrals: If W, is a ball in R? of radius p centered at P, then there is apointQ € W, 


such that 
J F-nds= [f div F dV = div F(Q) : volume (W,) 
MW, W, 


and so 


E in , 1 
div F(P) = limitdiv A E Jh Fons 
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example 5 


solution 


This is analogous to the limit formulation of the curl given at the end of Section 8.2. 
Thus, if div F(P) > 0, we consider P to bea source, for there is a net outward flow near 
P. If div F(P) < 0, P is called a sink for F. 

A Ct vector field F defined on R? is said to be divergence-free if div F = 0. If F is 
divergence-free, we have [fs F -dS = 0 for all closed surfaces S. The converse can also 
be demonstrated readily using Gauss’ theorem: If ff F -dS = 0 for all closed surfaces 
S, then F is divergence-free. If F is divergence-free, we thus see that the flux of F across 
any closed surface S is 0, so that if F is the velocity field of a fluid, the net amount of 
fluid that flows out of any region will be 0. Thus, exactly as much fluid must flow into 
the region as flows out (in unit time). A fluid with this property is therefore described 
as incompressible. 


Evaluate ff; F -dS, where F(x, y, z) = xy’i + x’yj + yk and S is the surface of the 
cylinder x? + y? = 1, bounded by the planes z = 1 and z = —1, and including the 
portions x? + y? < 1 when z = +1. 


We can compute this integral directly, but it is easier to use the divergence theorem. 
Now S is the boundary of the region W given by x? + y? < 1, —1 < z < 1. Thus, 
J{,F-dS = fffy (div F) dV. Moreover, 


Jff F)dV = [ff + y?) dxdydz = f. (H +y?) ax ay) dz 
= 2 ff, 0 + y?) dxdy. 


Before evaluating the double integral, we note that the surface integral satisfies 


II Fonds =2 [| (x? + y?) dxdy > 0. 
aw x?+y?<1 


This means that ffy F - dS, the net flux of F out of the cylinder, is positive. 
We change variables to polar coordinates to evaluate the double integral: 


X =r cosé, y =r sin9, 0<r<1, 0<0<2x. 


Hence, we have a(x, y)/a(r, 0) =r and x? + y? = r?. Thus, 


27 1 
II (x? + y?) axdy = | (J rar) team, 
x24y2<l 0 0 2 


Therefore, [ffy div FdV =m. A 


Gauss’ Law 


As we remarked earlier, Gauss’ divergence theorem can be applied to regions in space 
more general than symmetric elementary regions. To conclude this section, we shall use 
this observation in the proof of the following important results. 
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Theorem 10 Gauss’ Law Let M bea symmetric elementary region in R3. 
Then if (0,0, 0) g æM , we have 


r- Da 4 if (0,0,0) eM 
ioe if (0,0,0) g M, 


r(x, y,Z) =xi+ yj +zk 


r(x, Y, 2) = (r(x, y, Z) | = yX? + yY? +22. 


proof of Gauss’ law First suppose (0, 0,0) g M.Thenr/r?isaC? vector field 
on M and æM , and so by the divergence theorem, 


eee 


But V-(r/r?) = 0 forr + 0, as you can easily verify (see Exercise 38, Section 4.4). 
Thus, 


Now let us suppose (0, 0,0) € M. We can no longer use the preceding method, 
because r/r? isnot smooth on M , due to the zero denominator atr = (0, 0, 0). Because 
(0,0,0) € M and (0, 0,0) ¢ æM , there is an e > 0 such that the ball N of radius € 
centered at (0, 0, 0) is contained completely inside M . Let W be the region between M 
and N. Then W has boundary dN U aM = S. But the orientation on aN induced by the 
outward normal on W is the opposite of that obtained from N (see Figure 8.4.7). 

Now V- (r/r?) = 0 on W, and so, by the divergence theorem applied to the (nonele- 


mentary) region W, 
r 
[[as- fff v- (S)av =0. 


figure 8.4.7 Induced outward orientation on 
S; Wis M minus the ball N. 
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example 6 


Because 


[fes- ff cras+ ff = MGs, 
S a T 
where n is the outward normal to S, we have 
r-n r-n 
J, p3 e JI, re = 
However, on aN , n= —r/r andr =e, because ON is asphere of radius £, so that 


-Jf ‘Mase ff Sas=3 ff as. 


But ff, dS = 4zre?, the surface area of the sphere of radius e. This proves the 
result. i 


Gauss’ law has the following physical interpretation. The potential due to a point charge 
Q at (0, 0, 0) is given by 


Q Q 
Ant Any/xX2 + y2 +722" 


and the corresponding electric field is 


P(X, y, z) = 


Thus, Theorem 10 states that the total electric flux ff, E -dS (i.e, the flux of E out 
of a closed surface aM ) equals Q if the charge lies inside M and zero otherwise. Note 
that even if (0, 0,0) g M , E will still be nonzero on M . 

For a continuous charge distribution described by a charge density p in a region W, 
the field E is related to the density by 


divE=V-E =p. 


fyes- [ffen =o 


that is, the flux out of a surface is equal to the total charge inside. A 


Thus, by Gauss’ theorem, 


Divergence in Spherical Coordinates 
We next use Gauss’ theorem to derive the formula 


div = Š fi kis 


—— — (8) 
2 do psing ad p sing 30 


for the divergence of a vector field F in spherical coordinates, which was stated in 
Section 8.2. (Again, the subscripts here denote components, not partial derivatives.) 
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p sin @ d0 


figure 8.4.8 Infinitesimal volume 
determined by dp, dé, dọ at (p, 0, $). 


x Eoin 6 d0 


The method is to use the formula 


, DER 1 
div F(P) = jimit yy / Fonas: (9) 


where W is a region with volume V (W ), which shrinks down to a point P (in the main 
text we use a ball, but we can use regions of any shape). Let W be the shaded region in 
Figure 8.4.8. 

For the two faces orthogonal to the radial direction, the surface integral in equation (9) 
is, approximately, 


F (o+dp, ¢, 0) x (area of outer face) — F (o, $, 8) x (area of inner face) 
~ F lo +dp, 6, O)(p +dp)’singddde — F lp, o, 0)’ sinodo do 
w DIF paj 
x gp"? sing) dodo dê (10) 


by the one-variable mean-value theorem. Dividing by the volume of the region W, 
namely, p? sin @ do dọ dé, weseethatthe contribution to the right-hand side of equation 
(9) is 
tO: a 
——(p*F 11 
aap rel (11) 


for these faces. Likewise, the contribution from the faces orthogonal to the ¢ direction 
is 

1 Fo 
psing 30` 


O pes EEE, 
angaa pF ¢), and for the 6 direction, 
Substituting (11) and these expressions in equation (9) and taking the limit gives 
equation (8). 


Maxwell's Equations and the Prediction of Radio 
Waves: The Communication Revolution Begins 


We now discuss M axwell’s equations, which govern the propagation of electromagnetic 
fields. The form of these equations depends on the physical units oneis employing, and 
changing units introduces factors like 4x and the velocity of light. We shall choose the 
system in which M axwell’s equations are simplest. 
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Let E and H be C! functions of (t, x, y, z) that are vector fields for each t. They 
satisfy (by definition) M axwell’s equation with charge density p(t, x, y, z) and current 
density J (t, x, y, z) when the following conditions hold: 


V.E = p (Gauss’ law), (12) 
V -H = 0 (no negative sources), (13) 
VxE+ a = 0(Faraday’s law), (14) 
and 
dE ; 
VxH— ao J] (Ampére's law). (15) 


Of these laws, equations (4) and (6) were described in integral form in Sections 8.2 and 
8.4; historically, they arose in these forms as physically observed laws. A mpere’s law 
was mentioned for a special case in Section 7.2, Example 12. 

Physically, we interpret E as the electric field and H as the magnetic field. A ccording 
to the preceding equations, as time t progresses, these fields interact with each other, 
and with any charges and currents that are present. For example, the propagation of 
electromagnetic waves (TV signals, radio waves, light from the sun, etc.) in a vacuum 
is governed by these equations with J =0 and p =0. 

Because V-H = 0, we can apply Theorem 8 (from Section 8.3) to conclude that 
H = V x A for some vector field A. (We are assuming that H is defined on all of R? for 
each time t.) The vector field A is not unique, and we can use A’ = A + V f equally 
well for any function f(t, x, y, z), because V x Vf = 0. (This freedom in the choice 
of A is called gauge freedom.) For any such choice of A, we have, by equation (6), 


Substituting this equationandH = V xA into equation (7), and using the vector identity 
(whose proof we leave as an exercise) 


Vx(VxA)=V(V-A)—V’A, 


we get 
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Thus, 
a7A a 
VA — — = -J + V(V-A)+ —(V@). 
at J+V(V-A)+ mA $) 
Thatis, 
ə? A ag 
VA -—-— =-J+V(V-A+— ). 16 
T+ ( + £) (16) 
Again using the equation E + dA/at = —V¢ and the equation V -E = p, we obtain 
oA 5 ə(V-A) 
p=v-E=v- (-ve =) = Vo T 
Thatis, 
ə(V-A) 
V?$ = -p — ' 17 
= -p T (17) 
Now let us exploit the freedom in our choice of A. We impose the “condition” 
dp 
. — = i 1 
V-A+ at 0 (18) 


We must be sure we can do this. Supposing we have a given Ag and a corresponding ¢o, 
can we choose anew A = Ay + Vf and then anew ¢ Such that V -A + 0¢/at = 0? 
With this new A, the new ¢ is o — of /at; we leave verification as an exercise for the 
reader. Condition (10) on f then becomes 


_ _ (go — af /at) _ >, , 60 Of 
(=V: lA Vi) = Vt Vt 
or 
a? f Ido 
2 — ——— Z — . — 
Tiem: (v Ao + ). (19) 


Thus, to be able to choose A and ¢ satisfying V -A + 3¢/ət = 0, we must be able 
to solve equation (11) for f. We can indeed do this under general conditions, although 
we do not prove it here. Equation (11) is called the inhomogeneous wave equation. 

If we accept that A and ¢ can be chosen to satisfy V - A + 3¢/ət = 0, then equations 
(8) and (9) for A and œ become 


ə?A 

V’A — m~ -J; (8’) 
a2 

Vo — = =-p (9’) 


Conversely, if A and @ satisfy the equations V-A + d¢/at = 0, V?¢ — 07@/at? = 
—p, and V?A — a*A/at? = —J, then E = —V@ — dA/at and H = V x A satisfy 
M axwell’s equations. This procedure then “reduces” M axwell’s equations to a study of 
the wave equation.’ 


7T here are variations on this procedure. For further details, see, for example, Differential Equations of 
Applied Mathematics, by G. F. D. Duff and D. Naylor, Wiley, New York, 1966, or books on electromag- 
netic theory, such as Classical Electrodynamics, by J. D. Jackson, Wiley, New York, 1962. 
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Since the eighteenth century, solutions to the wave equation have been well studied 
(one learns these in most courses on differential equations). To indicate the wavelike 
nature of the solutions, for example, observe that for any function f, 


olt, x,y,z) = f(x —t) 


solves the wave equation V2@ — (d*@/at?) = 0. This solution just propagates the graph 
of f like a wave; thus, we might conjecture that solutions of M axwell’s equations are 
wavelikein nature. Historically, all of this was M axwell’s great achievement, and it soon 
led to Hertz’s discovery of radio waves. 

Mathematics again shows its uncanny ability not only to describe but to predict 


natural phenomena. 


exercises 


In Exercises 1 to 4, verify the divergence theorem for the given region W, boundary dW oriented outward, and vector field F. 


1 


10. 


. W =(0, 1] x [0,1] x [0, 1] 
F = xi+ yj+zk 


» W asin Exercise 1, and F = zyi+ xzj + xyk 


. W = (x,y,z) 2x2 +y24+2z2 <1} (the unit ball), 


F = xi+ yj +zk 
a W asin Exercise 3, and F = —yi+ xj + zk 


. Use the divergence theorem to calculate the flux of 
F = (x — y)i+(y —z)j + (z — x)k out of the unit 
sphere. 


» Let F = x3i + y3j + 22k. Evaluate the surface integral 
of F over the unit sphere. 


. Evaluate ffy F-dS, where F = xi + yj + zk and W is 
the unit cube (in the first octant). Perform the calculation 
directly and check by using the divergence theorem. 


» Repeat Exercise 7 for 
(a) F=i+j+k 
(b) F =x*i+ x2j +27k 
» LetF = yi + zj + xzk. Evaluate [f F -dS for each of 
the following regions W : 
(a) x? +y? <z<1 
(b) x? +y? <z<landx>0 
(c) x2? +y? <z<landx <0 
Repeat Exercise 9 for F = (x —y)i+(y—z)j+ 


(z — x)k. [The solution to part (b) only is in the Study 
Guide to this text. ] 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


Find the flux of the vector field F = (x — y?)i + yj +x°k 
out of the rectangular solid [0, 1] x [1, 2] x [1, 4]. 


Evaluate [f F-dS, where F = 3xy?i + 3x?yj + 29k 
and S is the surface of the unit sphere. 


Let W be the pyramid with top vertex (0, 0, 1), and base 
vertices at (0, 0, 0), (1, 0, 0), (0, 1, 0), and (1, 1, 0). Let 
S be the two-dimensional closed surface bounding W , 
oriented outward from W . Use Gauss’ theorem to 
calculate ffs F- dS, where: 


F(x, y, z) = (x?y, 3y2z, 92x). 


Let W be the three-dimensional solid enclosed by the 
surfaces x = yĉ, x = 9, z = 0, and x = z. Let S be the 
boundary of W . U se Gauss’ theorem to find the flux of 
F(x, y, z) = (3x — 5y)i + (4z — 2y)j + (8yz)k across 
S: ff F -dS. 


Evaluate ffy F-mdA, where F(x, y, z) = 

xi + yj — zk and W is the unit cube in the first octant. 
Perform the calculation directly and check by using the 
divergence theorem. 

Evaluate the surface integral ff; F-ndA, where 

F(x, y, z) =i +j +2(x2 + y*)2k and aS is the surface 
of the cylinder x? + y? < 1,0 <z <1. 


Prove that 


1 (VF) -Féxdyde = |f f F-ndS 
w aw 
- fff f V-Fdxdydz. 
w 


18. 


19. 


20. 


21. 


22. 


23. 


Prove the identity 
V-(F x G)=G-(V x F) -F-(V xG). 


Show that [ffy (1/r2) dxdydz = ffy (r-m/r?) dS, 
where r = xi + yj + zk. 


Fix vectors vı, ..., Vk € IR? and numbers (“charges”) 


qi, ..-, 4k. Define the function ø by (x, y, z) = Ph 
qi /(4z |ir — vi ||), where r = (x, y, z). Show that for a 
closed surface S and E = —V¢ġ¢, 


f|- ss=0 


where Q is the total charge inside S. (Assume that 
Gauss’ law from Theorem 10 applies and that none of 
the charges are on S.) 


Prove Green’s identities 


I tvg-nas= ff (f V2g + Vf - Vg) dV 
oW W 


If (tvg-gvf)-nas = ff f (tv?a-ov?t av. 
aw Ww 


Suppose F satisfies div F = 0 and curl F = 0 on all of 
R3. Show that we can write F = Vf, where V? f = 0. 


Let p be a continuous function on R3 such that 
p(q) = 0 except for q in some region W.Letq e W be 
denoted by q = (x, y, z). The potential of p is defined 
to be the function 

pla) 


aE lp — ail 


where ||p — qli is the distance between p and q. 


V (q), 


(a) Using the method of Theorem 10, show that 


l vens=— [ff pav 


for those regions W that can be partitioned into a 
finite union of symmetric elementary regions. 


(b) Show that ¢ satisfies Poisson’s equation 
V-¢ = —p. 


[HINT: Use part (a).] (Notice that if p is a charge density, 
then the integral defining @ may be thought of as the 
sum of the potential at p caused by point charges 
distributed over W according to the density p.) 


24. 


25. 


26. 
27. 


28. 


29, 


30. 
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Suppose F is tangent to the closed surface S = aW of a 
region W . Prove that 


JII (div F)dV = 0. 
W 


Use Gauss’ law and symmetry to prove that the electric 
field due to a charge Q evenly spread over the surface of 
a sphere is the same outside the surface as the field from 
a point charge Q located at the center of the sphere. 

W hat is the field inside the sphere? 


Reformulate Exercise 25 in terms of gravitational fields. 


Show how Gauss’ law can be used to solve part (b) of 
Exercise 29 in Section 8.3. 


Let S be a closed surface. Use Gauss’ theorem to show 
that if F is aC 2 vector field, then we have 
SLAY x F)-dS =0. 


Let S be the surface of region W . Show that 


[frends = 3votume tw). 
S 


Explain this geometrically. 


For a steady-state charge distribution and 
divergence-free current distribution, the electric and 
magnetic fields E(x, y, z) and H (x, y, z) satisfy 


VxE=0, V-H=0O, 
and VxH=J. 


V-J =0, V-E=p, 


Here p = p(X, y, z) and J (x, y, z) are assumed to be 
known. The radiation that the fields produce through a 
surface S is determined by a radiation flux density 
vector field, called the Poynting vector field, 


P=E xH. 


(a) If S isa closed surface, show that the radiation 
flux— that is, the flux of P through S— is given by 


[fro [Ifo 


where V is the region enclosed by S. 
(b) Examples of such fields are 


E(x, y,z) =2j+yk and 
H(x, y, z) = —xyi+ xj + yzk. 


In this case, find the flux of the Poynting vector 
through the hemispherical shell shown in 
Figure 8.4.9. (Notice that it is an open surface.) 
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figure 8.4.9 The surface for Exercise 30. 


(c) The fields of part (b) produce a Poynting vector figure 8.4.10 The surface for Exercise 30(c). 
field passing through the toroidal surface shown in 
Figure 8.4.10. What is the flux through this torus? 


8.5 Differential Forms 


The theory of differential forms provides an elegant way of formulating Green's, Stokes’, 
and Gauss’ theorems as one statement, the fundamental theorem of calculus. The birth 
of the concept of a differential form is another dramatic example of how mathematics 
speaks to mathematicians and drives its own development. These three theorems are, in 
reality, generalizations of the fundamental theorem of calculus of Newton and Leibniz 
for functions of one variable, 


b 
f f'(x) dx = f(b) — f(a) 


to two and three dimensions. 

Recall that Bernhard Riemann created the concept of n-dimensional spaces. If the 
fundamental theorem of calculus was truly fundamental, then it should generalize to 
arbitrary dimensions. But wait! The cross product, and therefore the curl, does not 
generalize to higher dimensions, as we remarked in footnote 3, in Section 1.3. Thus, 
some new idea is needed. 

Recall that Hamilton searched for almost 15 years for his quaternions, which ulti- 
mately led to the discovery of the cross product. What is the nonexistence of a cross 
product in higher dimensions telling us? If the fundamental theorem of calculus is the 
core concept, this suggests the existence of a mathematical language in which it can be 
formulated in n-dimensions. In order to achieve this, mathematicians realized that they 
were forced to move away from vectors and on to the discovery of dual vectors and an 
entirely new mathematical object, a differential form. In this new language, all of the 
theorems of Green, Stokes, and Gauss have the same elegant and extraordinarily simple 
form. 

Simply and very briefly stated, an expression of the type P dx + Q dy is a 1-form, or 
a differential one-form on a region in the xy plane, and F dx dyisa2-form. Analogously, 
we can define the notion of an n-form. There is an operation d, which takes n-forms to 
n+1-forms. Itis like a generalized curl and has the property that for œ = P dx+Q dy, 
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we have 


and so in this notation, Green’s theorem becomes 


f o= |do 
aD D 


which, interestingly, just switches the boundary operator ð with the d operator. H ow- 
ever, differential forms are more than just notation. They create a beautiful theory that 
generalizes to n-dimensions. 

In general, if M is an oriented surface of dimension n with an (n — 1)-dimensional 
boundary dM and if œ is an (n — 1)-form on M, then the fundamental theorem of 
calculus (also called the generalized Stokes’ theorem) reads 


f o= | do. 
aM M 


A useful thing for you to contemplate at this stage is the sense in which the funda- 
mental theorem of calculus becomes a special instance of this result. 

In this section, we shall give a very elementary exposition of the theory of forms. 
B ecause our primary goal isto show thatthe theorems of G reen, Stokes, and G auss can be 
unified under a single theorem, we shall be satisfied with less than the strongest possible 
version of these theorems. M oreover, we shall introduce forms in a purely axiomatic and 
nonconstructive manner, thereby avoiding the tremendous number of formal algebraic 
preliminaries that are usually required for their construction. To the purist our approach 
will be far from complete, but to the student it may be comprehensible. We hope that 
this will motivate some students to delve further into the theory of differential forms. 

We shall begin by introducing the notion of a 0-form. 


O-Forms 
Let K bean open set in R?. A 0-form on K isa real-valued function f:K — R. When 
we differentiate f once, it is assumed to be of class C t, and C * when we differentiate 
twice. 

Given two 0-forms fı and fz on K , we can add them in the usual way to get a new 
0-form fı + f2 or multiply them to get a 0-form fj f2. 
f(x, y, z) = xy + yz and f(x, y, z) = y sin xz are 0-forms on Rè: 

(fi + f2)(x, y, zZ) = XY + yz + y sin xz 

and 


(fi f2)(x, y, Z) = y?x sin xz + y?zsinxz. a 
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dx 


dz dy 
Kw” 


figure 8.5.1 The 
cyclic order of dx, 
dy, and az. 


1-Forms 


The basic 1-forms are the expressions dx, dy, and dz. At present we consider these to 
be only formal symbols. A 1-form œ on an open set K is a formal linear combination 


œw = P (x, y, z) dx + Q(x, y, z) dy + R(x, y, z) dz, 
or simply 
w = P dx + Q dy + R dz, 
where P, Q, and R are real-valued functions on K . By the expression P dx we mean 
the 1-form P dx + 0 -dy + 0- dz, and similarly for Q dy and R dz. Also, the order of 
P dx, Q dy, and R dz is immaterial, and so 


P dx + Q dy + R dz = R dz + P dx + Q dy, etc. 


Given two 1-forms œ = Pı dx + Qidy + Rı dz and œ = P2 dx + Q2dy + R2 dz, we 
can add them to get a new 1-form w; + w defined by 


œ +a = (P1 + P2) dx+(Q1+Q2) dy + (Ri + R2) dz, 
and given a 0-form f, we can form the 1-form f œ defined by 


fœ = ( fP1) dx + (fQ1) dy + ( fR:) dz. 


Let w = (x + y?) dx + (zy) dy + (e%7) dz and w2 = sin y dx + sin xdy be 1-forms. 
Then 


@ +œ = (X + y? + sin y) dx + (zy + sin x) dy + (e%%) dz. 
If f(x, y, z) =x, then 


fœ = X sin y dx + x sin xdy. A 


2-Forms 
Thebasic 2-formsare the formal expressions dx dy, dy dz, and dz dx. These expressions 
should be thought of as products of dx and dy, dy and dz, and dz and dx. 
A 2-form 7 on K is a formal expression 
n =F dxdy+G dydz + H dzdx, 


where F, G, and H are real-valued functions on K . The order of F dx dy, G dy dz, and 
H dzdx is immaterial; for example, 


F dxdy +G dydz + H dzdx = H dz dx + F dxdy + G dy dz, etc. 


Atthis point itis useful to note that in a 2-form the basic 1-forms dx, dy, and dz always 
appear in cyclic pairs (see Figure 8.5.1), that is, dx dy, dy dz, and dz dx. 
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By analogy with 0-forms and 1-forms, we can add two 2-forms 
ni = Fi dxdy + G; dydz + H; dzdx, 
i = l and 2, to obtain a new 2-form, 
nı +m = (Fit F2) dxdy+(Gi+G2)dydz+(H, + H2)dzdx. 
Similarly, if f is a 0-form and 7 is a 2-form, we can take the product 
fn = ( fF) dxdy + ( fG )dydz + ( fH ) dz dx. 
Finally, by the expression F dx dy we mean the 2-form F dx dy + 0 - dy dz + 0 - dz dx. 


The expressions 
m = x? dx dy + yx dy dz + sin zy dz dx 
and 
m = y dy dz 
are 2-forms. Their sum is 
m +m = x? dx dy + (y?x + y) dy dz + sin zy dz dx. 
If f(x, y, z) = xy, then 
fm = xy?°dydz. A 


3-Forms 


A basic 3-form is a formal expression dxdydz (in this specific cyclic order, as in 
Figure 8.5.1). A 3-form v on an open set K c R? is an expression of the form v = 
f (x, y, z) dxdydz, where f is a real-valued function on K . 

We can add two 3-forms and we can multiply them by 0-forms in the obvious way. 
There seems to be little difference between a 0-form and a 3-form, because both involve 
a single real-valued function. But we distinguish them for a purpose that will become 
clear when we multiply and differentiate forms. 


Let vı = ydxdydz, vy = e dx dy dz, and f(x,y,z) = xyz. Then v + v = 
(y +e’) dxdydzand f vı = y’xzdxdydz. A 


Although we can add two 0-forms, two 1-forms, two 2-forms, or two 3-forms, we do 
not need to add a k-form and a j-form if k Æ j. For example, we shall not need to write 


f(x, y, z) dxdy + (Xx, y, Z) dz. 


Now that we have defined these formal objects (forms), we can legitimately ask what 
they are good for, how they are used, and, perhaps most important, what they mean. The 
answer to the first question will become clear as we proceed, but we can immediately 
describe how to use and interpret them. 

A real-valued function on a domain K in R? is a rule that assigns a real number 
to each point in K . Differential forms are, in some sense, generalizations of the real- 
valued functions we have studied in calculus. In fact, 0-forms on an open set K are just 
functions on K . Thus, a0-form f takes points in K to real numbers. 
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figure 8.5.2 The four geometric types 
of subsets of an open set K c R° to 
which the theory of forms applies. 


We should like to interpret differential k-forms (for k > 1) not as functions on 
points in K , but as functions on geometric objects such as curves and surfaces. M any 
of the early Greek geometers viewed lines and curves as being made up of infinitely 
many points, and planes and surfaces as being made up of infinitely many curves. 
Consequently, there is at least some historical justification for applying this geometric 
hierarchy to the interpretation of differential forms. 

Given an open subset K c R?, we shall distinguish four types of subsets of K (see 
Figure 8.5.2): 


(i) pointsin K, 

(ii) oriented simple curves and oriented simple closed curves, C, in K, 
(iii) oriented surfaces, S c K, 

(iv) elementary subregions, R c K. 


The Integral of 1-Forms Over Curves 
We shall begin with 1-forms. L et 
w = P (x, y, z) dx + Q(x, y, z) dy + R(x, y, z) dz 


be a 1-form on K and let C be an oriented simple curve as in Figure 8.5.2. The real 
number that w assigns to C is given by the formula 


for f Pya dx + Q(x, y, Z) dy + R(x, y, Z) dz. (1) 
C G 


Recall (see Section 7.2) that this integral is evaluated as follows. Suppose that 
c: [a,b] > K, celt) = (x(t), y(t), z(t)) is an orientation-preserving parametrization 
of C. Then 


[ox [o= f [pote yoamn: R 


+ QOC, yt), 2(t)) -Z +ROD, yee), zte) -E| at 


Theorem 1 of Section 7.2 guarantees that f- w does not depend on the choice of the 
parametrization c. 

We can thus interpret a 1-form œ on K as a rule assigning a real number to each 
oriented curve C c K;a2-form 7 will similarly be seen to be a rule assigning a real 
number to each oriented surface S c K ; and a3-form v will be a rule assigning a real 
number to each elementary subregion of K . The rules for associating real numbers with 
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curves, surfaces, and regions are completely contained in the formal expressions we 
have defined. 


Let œ = xy dx + y*dy + dz bea1-form on R? and let C be the oriented simple curve 
in IR? described by the parametrization e(t) = (t?, t3, 1),0 <t < 1. C is oriented by 
choosing the positive direction of C to be the direction in which e(t) traverses C as t 
goes from 0 to 1. Then, by formula (1), 


1 1 
fo=] (t3(2t) + t9(3t2) + 0] t= f (t6 + 3t8) dt = B. 
C 0 0 21 


Thus, this 1-form w assigns to each oriented simple curve and each oriented simple 
closed curve C in R? the number fe w. A 


The Integral of 2-Forms Over Surfaces 


A 2-form ņ on an open set K c R? can similarly be interpreted as a function that 
associates a real number with each oriented surface S c K. This is accomplished by 
means of the notion of integration of 2-forms over surfaces. L et 


n =F (x,y,z) dxdy+G(x, y,z)dydz+H (x, y, z) dz dx 


be a 2-form on K , and let S c K bean oriented surface parametrized by a function 
: D > R?, D c R?, ®(u, v) = (x(u, v), y(u, v), z(u, v)) (See Section 7.3). 


Definition If S is such a surface and n is a 2-form on K , we define ff; n by the 
formula 


eg) F dxdy +G dydz + H dz dx 
5 


= a X(u, v), y(U, v), Z v)) aa 
(2) 
+ G (x(u, v), ytu, v), z(u, o)) SA 
+ H (x(u, v), yu, v), z(u, v). SE A 
where 
əx əx əy əy əz əz 


a(x, y) |əu əv d(y,Z) |ðu ðv alz, x) _|ðu ðv 
alu, v) əy ay d(u, v) əz az alu, v) fəx ax] 
ðu ðv ðu ðv ðu ðv 
If S is composed of several pieces S;,i = 1,..., k, as in Figure 8.4.4, each with 
its own parametrization ®;, we define 


Ip-X ff 


We must verify that ff n does not depend on the choice of parametrization ®. 
This result is essentially (but not obviously) contained in Theorem 4, Section 7.6. 
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example 6 


solution 


example 7 


solution 


Let n = z? dx dy bea 2-form on R?, and let S be the upper unit hemisphere in R?. Find 


Ssn. 


Let us parametrize S by 
(u, v) = (sinu cos», sinu sin v, cosu), 


where (u, v) e D = [0, x/2] x [0, 27]. By formula (2), 


H=) cos? u ray | dude 


where 
a(x, y) cosu cosv —sinusinu 
d(u, v) cosusinv sinucosv 
= sin u cosu cos? v + cosu sinu sin? v = sin u cosu. 
Therefore, 


[fn [fcostucosu sinu dude 
S D 
2x n/2 27 cos? u x/2 x 
= f costusinududy = f |- | dv=>. A 
0 0 0 4 0 2 


Evaluate ff; x dydz+ydxdy, where S is the oriented surface described by the 
parametrization x = u + v, y =u? — v2, Z = uv, where (u, v) e D = [0, 1] x [0, 1]. 


By definition, we have 


= ee eee 
au,v) |v =r 
Axy) Jo Ij 
alu, v) |2u —2v| are 


Consequently, 


[ff xaydz ty andy = ff tu + oi 2yu? +02) +(u? — au +v)ldudv 
S D 


lL pl 
=4 [fe +urydudo=4 | fv? + uv) dude 
D o Jo 
1 2,,271 1 2 
=4/ fur 4 dv=4 | (P+F) 
0 2 0 0 2 
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The Integral of 3-Forms Over Regions 


Finally, we must interpret 3-forms as functions on the elementary subregions of K . Let 
v = f(x, y, z) dxdydz be a 3-form and let R c K bean elementary subregion of K . 
Then to each such R c K we assign the number 


J= f(x, y, z) dxdydz, (3) 
R R 
which is just the ordinary triple integral of f over R, as described in Section 5.5. 


Suppose v = (x + z) dx dy dz and R = [0, 1] x [0, 1] x [0, 1]. Evaluate fff} v. 


We compute: 


[ffo= [ [fora dxdyde= | f [a dxdydz 
[f +n] ya f f (3+2) ydz= f (5 +2) a2 


zZz z2]! 
adie a A 
E+, 


The Algebra of Forms 


We now discuss the algebra (or rules of multiplication) of forms that, together with 
differentiation of forms, will enable us to state Green's, Stokes’, and Gauss’ theorems 
in terms of differential forms. 

If wis ak-form and 7 is an l-form on kK ,0 < k+! <3, there is a product called the 
wedge product w ^ 7 of œw and 7 thatisak +1 form on K . The wedge product satisfies 
the following laws: 


(i) For each k there is a zero k-form 0 with the property that 0 + œ = œ for all k- 
forms w and 0 A n = 0 for all l-forms n if 0 < k +1 <3. 


(ii) (Distributivity) If f is a0-form, then 
(fai + a2) An = f(@, An) + (a2 An). 


(iii) (Anticommutativity) œ An = (—1)"(n Ao). 


(iv) (Associativity) If a, w2, w3 are ky, k2, k3 forms, respectively, with k; +k2+k3 < 3, 
then 


w A (œ A @3) = (@1 A w) A @3. 
(v) (Homogeneity with respect to functions) If f is a0-form, then 
woA(fn) =(fo)An= flæsn). 


Notice that rules (ii) and (iii) actually imply rule (v). 
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(vi) The following multiplication rules for 1-forms hold: 


dx A dy = dxdy 

dy A dx = —dxdy = (—1)( dx ^ dy) 

dy A dz =dydz = (—1)(dz ^ dy) 

dz A dx = dz dx = (—1)(dx a dz) 

dx A dx = 0, dy Ady = 0, dz a dz=0 

dx A (dy A dz) = (dx Ady) A dz = dxdy dz. 


(vii) If f isa0-form and w is any k-form, then f Aw = fo. 


Using laws (i) to (vii), we can now find a unique product of any |-form 7 and any k-form 
œw if0<k+1 <3. 


example 9 | Show that dx ^ dydz = dxdy dz. 


solution By rule (vi), dydz = dy ^ dz. Therefore, 
dx Adydz = dx ^ (dy ^ dz) = dxdydz. A 


example 10 | If œ= x dx + ydy and ņ = zy dx + xz dy + xy dz, find œ ^ n. 


solution Computing w A n, to get 


œw ^A n = (X dx + y dy) A^ (zy dx + xz dy + xy dz) 
= [(x dx + y dy) A (zy dx)] + [(x dx + y dy) A (xz dy)] 
+ [(x dx + y dy) A (xy dz)] 
= xyz(dx A dx) + zy2(dy A dx) + x*z(dx A dy) + xyz(dy ^ dy) 
+x?y(dx A dz) + xy?(dy A dz) 
= —zy* dx dy + x2z dx dy — x?y dz dx + xy*dydz 
= (x*z — y?z) dxdy — x?y dz dx + xy?dydz. A 


example 11 If œ = x dx — y dy and ņ = x dy dz + z dxdy, find w A ņ. 


solution œw A n = (X dx — y dy) ^ (x dy dz + z dx dy) 

= [(x dx — y dy) A (x dy dz)] + [(x dx — y dy) ^ (z dx dy)] 

= (x? dx A dydz) — (xy dy A dy dz) + (xz dx A dx dy) 
—(yzdy A dxdy) 

= [x?dx a (dy A dz)] — [xy dy A (dy A dz)] + [xzdx a (dx a dy)] 
—[yzdy ^ (dx  dy)] 

= x? dxdydz — [xy(dy A dy) A dz] + [xz(dx A dx) A dy] 
— [yz(dy ^ dx) A dy] 

= x? dxdydz—xy(0A dz) + xz(0 A dy) + [yz(dy Ady) A dx] 

= x? dxdydz. A 


The last major step in the development of this theory is to show how to differentiate 
forms. The derivative of a k-form is a (k + 1)-form if k < 3, and the derivative of a 


example 12 


solution 


example 13 


solution 


8.5 Differe 


3-form is always zero. If w is a k-form, we shall denote the derivative 
operation d has the following properties: 


(1) If f:K — Risa 0-form, then 


of of of 
df= r dx + ay 


(2) (Linearity) If œ and œ are k-forms, then 
d(@1 + @2) = da; + da. 
(3) If wis ak-form and 7 is an|-form, 


dlwa n) = (dw an) +(—1)*(@ A dy). 


(4) d(dw) = 0 and d(dx) = d(dy) = d(dz) = 0 or, simply, d? = 0. 
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of w by dw. The 


Properties (1) to (4) provide enough information to allow us to uniquely differentiate 


any form. 


Letw = P(x, y,z) dx+ Q (x, y, z) dy bea 1-form on some open set K 


d[P (x, y, z) dx + Q(x, y, z) dy] 
= d[P(x, y,z) A dx] + d[Q(x, y, z) A dy] 
= (dP a dx) +[P A d(dx)] + (dQ Ady) +[Q a d( 
= (dP a dx) + (dQ Ady) 


aP oP aP 
= (= dx + Thi ei az) A dx 


aQ aQ aQ 
+ (3 dx + ay = dz) A dy 


aP aP aP 
= Ga ax) + (Fay ax) + (Foza ax) 


+ (Ranay) + (Royagy) + (Faz agy 


Ox OZ 
oP oP aQ dQ 
= ao zz 02 ox + xY- z9 
aQ  əP oP aQ 
= (3 - z) dx dy + zZ- gz a2. a 


c R?.Finddo. 


(using 2) 
dy)] (using 3) 
(using 4) 


(using 1) 


) 


Let f be a 0-form. Using only differentiation rules (1) to (3) and the fact that d (dx) = 


d(dy) = d(dz) = 0, show that d(df) =0. 
By rule (1), 


and so 


af af af 
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Working only with the first term, using rule (3), we get 


af af af af 
d (Fox) = TE na) =d (2) ade Z aaa 
- (5 3f af 


aK? X+ ayax’) + ET dz) A dx+0 


a’ f ə? f 
= TA OX + ay N dx 


a2 f 


= aa dxdy + 


oZ OX 


Similarly, we find that 


af a? f ə? f 
d (žo) = ay dx dy emt 


and 


of ə? f ak 


Adding these up, we get d (d f ) = 0 by the equality of mixed partial derivatives. 


example 14 | Show that d( dxdy), d(dy dz), and d( dz dx) are all zero. 
solution To prove the first case, we use property (3): 
d(dx dy) = d(dx A dy) = [d(dx) A dy — dx A d(dy)] = 0. 
The other cases are similar. A 
example 15 | If n =F (x,y,z) dxdy +G (x, y, z)dydz + H (x, y, z) dz dx, find dy. 
solution By property (2), 
dn = d(F dxdy) + d(G dydz) +d(H dz dx). 
We shall compute d (F dx dy). Using property (3) again, we get 
d(F dxdy) =d(F A dxdy) =dF a (dxdy) + F ^ d(dx dy). 


By Example 14, d(dx dy) = 0, so we are left with 


dF A (dx dy) = (Fo w+ dz) A (dx A dy) 
oF 
“Ba malls [Fay a (ax ay] 


+ [Zez A (dX ^ ay) . 
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Now 
dx A (dx A dy) = (dx A dx) Ady = 0 Ady = 0, 
dy A (dx A dy) = —dy ^ (dy A dx) 
= —(dy Ady) a dx =0adx=0, 
and 
dz a (dx A dy) = (—1)*(dx A dy) A dz = dxdydz. 
Consequently, 


oF 
d(F dxdy) = zz Wx dy dz. 
Analogously, we find that 
d(G dydz) = °° axdydz and d(H dz dx) = 7 dxdydz. 


Therefore, 


oF dG OH 


We have now developed all the concepts needed to reformulate Green's, Stokes’, and 
Gauss’ theorems in the language of forms. 


Theorem 11 Green's Theorem Let D be an elementary region in the xy 
plane, with aD given the counterclockwiseorientation. Suppose w = P(x, y) dx+ 
Q(x, y) dy is a 1-form on some open set K in R? that contains D . Then 


| = do. 
3D D 


Here dw isa2-form on K and D isin fact a surface in R? parametrized by ®: D —> 
R?, (x, y) = (x, y, 0). Because P and Q are explicitly not functions of z, then aP /az 
and 0Q /ðəz = 0, and by Example 12, dw = (dQ /ax — aP /ay) dx dy. Consequently, 
Theorem 13 means nothing more than that 


7 aQ aP 
a ff (5 ay) oo 


which is precisely Green’s theorem. Hence, Theorem 13 holds. Likewise, we have the 
following theorems. 


Theorem 12 Stokes' Theorem Let S be an oriented surface in R? with a 
boundary consisting of a simple closed curve oS (Figure 8.5.3) oriented as the 
boundary of S (see Figure 8.2.1). Suppose that w is a 1-form on some open set K 


that contains S. Then 
/ o= | do. 
as 5 
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figure 8.5.3 An oriented surface to which Stokes’ theorem 
as applies. 


y 


Theorem 13 Gauss’ Theorem LetW c R? bean elementary region with 
aW given the outward orientation (see Section 8.4). If n isa 2-form on some region 


ow Ww 


The reader has probably noticed the strong similarity in the statements of these theo- 
rems. In the vector-field formulations, we used divergence for regions in R? (Gauss’ the- 
orem) and the curl for surfaces in IR? (Stokes’ theorem) and regions in IR? (Green's 
theorem). H ere we just use the unified notion of the derivative of a differential form for 
all three theorems; and, in fact, we can state all theorems as one by introducing a little 
more terminology. 

By an oriented 2-manifold with boundary in R? we mean a surface in R? whose 
boundary is a simple closed curve with orientation as described in Section 8.2. By an 
oriented 3- manifold in R? we mean an elementary region in R? (we assume its boundary, 
which is a surface, is given the outward orientation discussed in Section 8.4). We call 
the following unified theorem “Stokes’ theorem,” according to the current convention. 


Theorem 14 General Stokes’ Theorem LetM bean oriented k-manifold 
in IR? (k = 2 or 3) contained in some open set K . Suppose w is (k — 1)-form on 


K . Then 
i o= | do. 
aM M 


Here the integral is interpreted as a single, double, or triple integral, as is appropri- 
ate. In fact, it is this form of Stokes’ theorem that generalizes to spaces of arbitrary 
dimension. 


exercises 
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1. Evaluate w ^ ņ if 


(a) w=2xdx+ydy 
n =x? dx + y?dy 


(b) œ=x dx -— ydy 
n=ydx+x dy 


(c) o=xdx+ydy+zdz 
n=zdxdy+xdydz+ydz dx 


(d) w=xy dydz + x2 dx dy 
n= dx + dz 


(e) œ= e% dxdy 
n=e “V2 dz 


2. Prove that 


(a, dx + a2 dy +a3dz) A (by dydz + bz dz dx + b3 dx dy) 


3 
= (dae) vaya 
i=l 


3. Find dw in the following examples: 


(a) o =x?y +y? 
(b) œ = y? cosx dy + xy dx + dz 
(c) œ = xy dy + (x + y)? dx 
(d) œw = x dx dy + z dydz + y dz dx 
(e) œ = (x? + y?) dydz 
(f) œ = (x? + y? +z?) dz 

—X 
ON Fy? 
(h) œw =x2ydydz 


dx + dy 


y 
x? + y? 


. LetC be the line segment from the point (—2, 0, 1) to 
(3, 6,9). Let w1 = y dx + x dy + xy dz, 
w2 = Z dX + y dy + 2x dz, and f(x, y, z) = xy. 


Calculate the following: 
9 farta 
c 


a f fai w f ie 
C Cc 


. LetC be parameterized by c(t) = (t? + 4t, t + 1), 
t e (0, z]. Let w1 = y dx + x dy, w2 = y2dx + xdy, 
and f(x, y) = x. Calculate the following: 


a f for o f f %2 O [orton 
c C c 


6. 


10. 


11. 


12. 


13. 


Let V: K — R? bea vector field defined by 
V(x, y,Z) =G(x, y,z)i+ H (x, y, z)j + F (x, y, z)k, 
and let n be the 2-form on K given by 

n =F dxdy+G dydz + H dz dx. 


Show that dn = (div V) dxdydz. 


. If V = A(x, y, z)i+ B(x, y,z)j +C(X, y, Zkisa 


vector field on K c R?, define the operation Formz: 
Vector Fields —> 2-forms by 


Form2(V) = A dydz + B dz dx + C dxdy. 


(a) Show that 
Formz (V1 + V2) = æ Formz (Vi) + Form2(V2), 
where a is a real number. 


(b) Show that Form2(curl V) = dw, where 
œ = A dx + B dy +C dz. 


» Using the differential form version of Stokes’ theorem, 


prove the vector field version in Section 8.2. R epeat for 
Gauss’ theorem. 


. Interpret Theorem 16 in the case k = 1. 


Letw = (x + y)dz + (y +z) dx+(x +z) dy, and let S 
be the upper part of the unit sphere; that is, S is the set of 
(x, y, z) with x? + y? +z? = 1 and z > 0. aS is the unit 
circle in the xy plane. Evaluate f, œ both directly and 
by Stokes’ theorem. 


Let T be the triangular solid bounded by the xy plane, 
the xz plane, the yz plane, and the plane 
2x + 3y + 6z = 12. Compute 


I Fy dxdy + Fo dydz+F3dz dx 
ar 


directly and by Gauss’ theorem, if 
(a) Fy =3y, Fz = 18z, F3 = —12; and 
(b) Fi =z, Fo =x’, Fg =y. 


Evaluate ff; œ, where œ = z dx dy + x dy dz + y dz dx 
and S is the unit sphere, directly and by Gauss’ theorem. 


Let R be an elementary region in R3. Show that the 
volume of R is given by the formula 


TEESI x dydz + ydz dx + z dxdy. 
3 J Jar 
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14. In Section 4.2, we saw that the length |(c) of a curve 


c(t) = (x(t), y(t), z(t)), a < t < b, was given by the 


formula 
> ds 
Nd) = fas= f — ]} dt 
es, 


where, loosely speaking, 
(ds)? = (dx)? + (dy)? + (dz)2, thatis, 


dx\? sdy\2 dz\2 
VS) H H 
Now suppose a surface S is given in parametrized form 
by &(u, v) = (x(u, v), y(u, v), z(u, v)), where 


ds 
dt 


review exercises for chapter 8 


(u, v) e D. Show that the area of S can be expressed as 


usr [fos 
D 


where formally (dS)? = (dx A dy)? + (dy A dz)? + 
(dz A dx)?, a formula requiring interpretation. [HINT: 


and similarly for dy and dz. Use the law of forms for the 
basic 1-forms du and dv. Then dS turns out to bea 
function times the basic 2-form du dv, which we can 
integrate over D.] 


1. 


Let F = 2yzi + (—x + 3y + 2)j + (x? + z)k. Evaluate 
Jf; (V x F) - dS, where S is the cylinder x? + y? = a?, 
0 < z < 1 (without the top and bottom). W hat if the top 
and bottom are included? 


. Let W bea region in R3 with boundary aW . Prove the 


identity 


I iF xv xG)}-ds= ff (Vx F)-(V x G)dV 
aw W 
- [[[Fwxvxeyav, 

W 


. Let F = x*yi+ z8j — 2xyzk. Evaluate the integral of F 


over the surface of the unit cube. 


. Verify Green's theorem for the line integral 


[x dx + y dy, 
c 


when C is the boundary of the region between the curves 
y=xandy =x3, 0<x<1. 


. (a) Show that F = (x3 — 2xy3)i — 3x2y4j is a gradient 


vector field. 


(b) Evaluate the integral of F along the path 
x =cos? 0, y = sin? 0, 0 <6 < 7/2. 


» Can you derive Green's theorem in the plane from 


Gauss’ theorem? 


» (a) Show that 


F = 6xy(cosz)i + 3x?(cosz)j — 3x2y(sinz)k is 
conservative (see Section 8.3). 


10. 


11. 


12. 


13. 


(b) Find f such that F = Vf. 


(c) Evaluate the integral of F along the curve 
x =cos?6, y = sin? 0, z = 0,0 < 0 < 2/2. 


» Letr(x, y, z) = (Xx, y, 2), r = |irij. Show that 


V2(logr) = 1/r? and V?(r") = n(n + 1)r"~?. 


. Letthe velocity of a fluid be described by 


F = 6xzi+ x?yj + yzk. Compute the rate at which 
fluid is leaving the unit cube. 


LetF = x*i+ (x?y — 2xy)j — x2zk. Does there exist a 
G such that F = V x G? 


Leta bea constant vector and F = a x r [as usual, 
r(x, y,Z) = (x, y, z)]. Is F conservative? If so, find a 
potential for it. 


Show that the fields F in (a) and (b) are conservative and 
find a function f such that F = Vf. 

(a) F = (y2 )i + (2y e)j 

(b) F = (sin y)i + (x cos y)i + (e7)k 

(a) Let f(x, y, z) = 3xye2. Compute V f. 

(b) Letelt) = (3cos3t, sin? t, €), 0 <t < r. 


Evaluate 
fxi -ds. 
c 


(c) Verify directly Stokes’ theorem for gradient vector 
fields F = Vf. 


14. 


15. 


16. 


17. 


18. 


19. 


20. 


21. 


Using Green’s theorem, or otherwise, evaluate 
Jc. x3dy — y? dx, where C is the unit circle 
(x2 + y? =). 


Evaluate the integral ffs F- dS, where 
F = xi + yj + 3k and where S is the surface of the unit 
sphere x? + y? +z? =1. 


(a) State Stokes’ theorem for surfaces in R?. 
(b) Let F bea vector field on R? satisfying V x F = 0. 


Use Stokes’ theorem to show that fe F -ds=0 
where C is aclosed curve. 


Use Green's theorem to find the area of the loop of the 
curve x = a sin 8 cos 0, y = a sin? o, fora > 0 and 
0<O0<nz. 


Evaluate fe yz dx+xzdy+xydz, whereC is the curve 
of intersection of the cylinder x? + y? = 1 and the 
surface z = y?. 


Evaluate fe (x + y) dx + (2x — z) dy + (y + z) dz, 
where C is the perimeter of the triangle connecting 
(2, 0, 0), (0, 3, 0), and (0, 0, 6), in that order. 


Which of the following are conservative fields on R3? 
For those that are, find afunction f such that F = Vf. 


(a) F(x, y, z) = 3x?yi + x3j + 5k 
(b) F(x, y,z) =(x+z)i-(y+z)j +(x — y)k 
(c) F(x, y, z) = 2xy3i + x223j + 3x2yz2k 


Consider the following two vector fields in R3: 


(i) F(x, y, z) = y?i — z?j + x’k 
(ii) G(x, y, z) = (x3 — 3xy?)i + ( y3? — 3x2y)j + zk 


(a) Which of these fields (if any) are conservative on 
R3? (That is, which are gradient fields?) Give 
reasons for your answer. 

Find potential for the fields that are conservative. 
Let æ be the path that goes from (0, 0, 0) to (1, 1, 1) 
by following edges of the cube 0 < x <1, 

0<y <1,0 <z <1from(0, 0, 0) to (0, 0, 1) to 
(0, 1, 1) to (1, 1, 1). Let £ be the path from (0, 0, 0) 
to (1, 1, 1) directly along the diagonal of the cube. 
Find the values of the line integrals 


(b 
(c 


fF foss fF foss 
a a B B 


22. 


23. 


24, 
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Consider the constant vector field 
F(x, y,z) =i+2j —kinR3. 


(a) Find a scalar field ¢(x, y, z) in R? such that 
Vo =F in R? and ¢(0, 0, 0) = 0. 

(b) On the sphere © of radius 2 about the origin, find all 
the points at which 
(i) isa maximum, and 
(ii) @ isaminimum. 

(c) Compute the maximum and minimum values of @ 
on &. 


Let F be aC?! vector field and suppose 

V -F(xXo, Yo, Zo) > 0. Show that for a sufficiently small 
sphere S centered at (Xo, Yo, Zo), the flux of F out of S is 
positive. 


Let B c R? bea planar region, and let O € R? bea 
point. If we connect all points in B to O, we get a cone, 
say C, with vertex O and base B. Show that 


Volume (C ) = ; area (B) h, 


where h is the distance of O from the plane of B, using 
the following steps. 


figure 8.R.1 


1. LetO bethe origin of the coordinate system. D efine 
r(x, y, Z): = (x, y, z). Evaluate the flux of r through 
the boundary of C , that is, [fyc r-ndA, where nis 
the outward unit normal to aC. 


2. Evaluate the total divergence [/[. V- rdv. 


3. Use Gauss’ theorem, which states that the total 
divergence of a vector field within a region enclosed 
by a surface is equal to the flux of that vector field 
through the surface: 


[[[ 9-10 - ff. r-ndA. 
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Answers To Odd-Numbered Exercises 


Some solutions requiring proofs may be incomplete or be omitted. 


Chapter 1 IL x =0,z=0,yeRx=0,y =0,zER; 

m y=0,x,zeR;x=0,y,zeER 

Section 1.1 
13. {(2s, 7s +2t,7t)|seR,teR} 

L 4;17 

15. I(t) = —i+ (t -1l)j-—k 

3. (—104+ 16a, —24 — 4b, —22 + 26c) 
I7. I(t) =(2t — 1)i— j + (3t — 1)k 

5. y 
19. {si+ 3sk—2tj)}0<s<10<t<1} 
2L Yes. 
ZB. 0 

i 25. If (x, y, z) lies on the line, then x = 2 +t, y = —2 + t, 


and z = —1 +t. Therefore, 

2x —3y+z2-2=4+42t+6-3t—14+t-2=7, 
which is not zero. Hence, no (x, y, z) satisfies both 
conditions. 


N 


a Yes, 


B 


The set of vectors of the form 

v= pa+qb+rc¢ 
where0<p<1,0<q<1,and0<r<1. 
3L All points of the form 
(Xo +t(X1 — Xo) +$(X2 — Xo), Yo + t( yı — Yo) 
+8(Y2 — Yo), zo + t(Z1 — Zo) + $(22 — Zo)) 


for real numbers t and s. 


33. If one vertex is placed at the origin and the two adjacent 
sides are uand v, the new triangle has sides bu, bv, and 


9, y b(u— v). 
3 TTA 35. (1,0, 1) + (0, 2, 1) = (0, 2, 0) + (1, 0, 2) 
: % v-w 37. Two such lines (there are many others) are 
1 - x=ly=t,z=tandx=1,y=t,z=-t. 
x 
TEn Section 1.2 
L 6 
3. 99° 


493 


494 


3L 


. (a) F = (3V2i+ 3/2)) 
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No, itis 75.7; it would be zero only if the vectors were 
parallel. 


lul = v5, IVi = V2, u: v= -3 
lull = v11, Iivi = v62, u- v= —14 
lull = v14, |ivi| = v26, u- v = —17 
b = 5/4, ¢ = —5/2 


They point in opposite directions. 


. In Exercise 9, cos~! (—14/./11,/62); in Exercise 10, 


x /2; and in Exercise 11, cos} (—17//14/26). 
X=3,7 

—4(—i +j + k)/3 

1/2 


Any (x, y, z) with x + y +z = 0; 
for example, (1, —1, 0) and (0, 1, —1). 


. Draw aright triangle. Label the two legs vand w, so that 


the hypotenuse is vw. By hypothesis, we have 
IVI? + w? = |v- wa}? = Ivi]? — 2v- w+ |W. 
This implies v- w= 0, so wand ware orthogonal. 


i+ 4j, 0 ~ 0.24 radian east of north 


(a) 12:03PM. (b) 4.95 km 
F =50lb 
Fy= 50 sin (50°) Ib 
{50° 
x 
Fx= 50 cos (50°) Ib 
(4.9, 4.9, 4.9) and (—4.9, 4.9, 4.9) N. 


(b) ~0.322 radian or 
18.4° (c) 18/2 

Draw arectangle. L abel two of the nonparallel sides v 
and w so the two diagonals are v + wand v— w Then 
these diagonals are perpendicular if and only if 

0 = (v+ w) - (v— w = |v? — ||, which holds if 
and only if ||vi| = ||. 


Section 1.3 
T 2. 1 3 0 1 
L3 0 1;/=-8 |1 2 1|=8 
2 0 2 2 0 2 
3. —3i+j+ 5k 
5 4/35 
7. 10 
9. +k 
IL +(113i+ 17j — 103k) /./23, 667 
B. u4+ v= 3i- 3j + 3k u. v= 6; |u| = V6; 


15. 


17. 


3L 


35. 


Ivi = 3; ux v= —3i+ 3k 


(a) x+y+z—-1=0 
(b) x +2y+3z-6=0 
(c) 5x +2z = 25 

(d) x +2y -3z = 13 


Show (0, —2, —1) — (1, 4, 0) is parallel to 
(1, 4, 0) — (2, 10, 1), so that the three points lie on a line. 


(a) The parallel planes Ax + By +Cz+D = 0 and 
oAX + oBy +o0Cz+D’ = 0 are identical when 
D’ = oD and otherwise never intersect. 

(b) Ina line. 

The line x =t, y = 2t, z = —5t. 


(a) Do the first by working out each side in coordinates, 
and then use that and a x (bx d = -(bx ġ xa 
to get the second. 

(b) Use the identities in part (a) to write the quantity in 
terms of inner products. 


(c) Use the identities in part (a) and collect terms. 


Compute the results of Cramer’s rule and check that they 
satisfy the equation. 


7 vx W= (i+ fj) x (2j —k) = 2(ix j) -ix k 


+ 2(j x j) -j x k= (-1,1, 2) 
xX —2y+324+12=0 

4x —6y — 10z = 14 

10x — 2y —2z = 8 


10x — 17y +z +25= 0 
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37. For Exercise 25, note that (2, —3, 1) - (1, 1, 1) = 0, and (b) Rectangular 
so the line and plane are parallel and (2, —2, —1) does 


not lie in the plane. For Exercise 26, the line and plane = M z 
are parallel and (1, —1, 2) does lie in the plane. 2 1 =2 
0 3 4 
39. /2/13 
V2/ No 1 1 
4L (a) Show that M satisfies the geometric properties of -2/3 —2 3 
RxF. (b) 2/3 
Spherical 
43. Show that nı(N x a) and n2(N x b) have the same 9 $ 
magnitude and direction. p 
3 arctan } m — arccos (2/3) 
45. One method is to write out all terms in the left-hand side 5 x/2 arccos (4/5) 
and see that the terms involving A all cancel. A nother Vi 
method is to first observe that the determinant is linear 2 arctan (v 2/2) x/3 
in each row or column and that if any row or column is 5 77/6 arccos 2 
repeated, the answer is zero. Then 
a b1 & Cylindrical 
a2 + Aa b2 + Abi C2 + àC1 r 0 z 
a3 b3 3 J arctan 4 -2 
a. bi c ar bi ci] far b c 3 m/2 4 
= ja2 b2 c2}/+Ala1 bi ci]=Ja2 b2 C2}. 
a3 b3 C3 a3 b3 c3 a3 b3 c3 v3 arctan ( v/2/2) 1 
4 77/6 3 
Section 1.4 5. (a) Rotation by z around the z axis. 
L (0,0, ¢) = (4, 52/3, 32/4) (b) Reflection across the xy plane with a radial 
expansion by a factor of 2. 
3. (a) (c) Rotation by x/2 about the z axis together. 
Cylindrical Rectangular 7. (a 
r 0 z x y z a z 
1 
2 z=x +y? 
0 
3 
1 y 
2 
X 
Spherical (b) 
E 
p 0 ¢ 
V2 45° 45° 
2/5 x /2 x — arccos (2./5/5) 
10 45° 0 
5 1/6 arccos £ 
1 x/6 m/2 w. 
2/2 32/4 32/4 x 
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9. No; (p, 6, @) and (—p, 6 + x, x — $) represent the 
same point. 


IL r? +z? =R? 
IB. (a) e = (xi + yj +2k)/\/x?2 + y? +z? 
= (xi + yj +zk)/p 
@ = (—yi + xj)/ yx? + y? = (—yi + xj)/r 
© = (xzit yzj — (x? + y?)k) /rp 


@ xj=-yk/ yx? + y2,@ xj 
= (xz/rp)k+ (r/p)i 


(b 


15. (a) The length of xi+ yj +zkis (x? + y2+22)!/2 = p. 


(b) coso =z/(x2 + y? + z2) 1/2 
(c) cose = x/(x2 + y?) 1/2 


17. 0 <r <a,0 <8 < 2m means that (r, 9, z) is inside the 
cylinder with radius a centered on the z axis, and |z| < b 


means that it is no more than a distance b from the xy 
plane. 


19. —d/(6cos¢) < p <d/2,0<6 < 2x, and 
m —cos-($)<@ <n. 


2L This is a surface whose cross section with each surface 
z =C is a four-petaled rose. The petals shrink to zero as 


Ic] changes from 0 to 1. 
Section 1.5 
L7 


3. |x- y| = 10 = V5V20 = |X Ilyil, so |x- y| < IIXIIIIyII is 


true. 


T17. 


Ix+ yll = 35 = IX] + Iyi}, so x+ yll < IXI] + IYI 
is true. 


Ix- yi = 5 < 65 = [XI Iyso Ix- y1 < IIX [Iyl] is true. 
[x+ yl = v28 < v5 + JTS = |x] + IYI], S0 
x+ yl] < Ixl] + [yl is true. 


. We assume ||vi] = || wi. Then |||? = IIw1?, so 


(v+ w - (V— w =v: v- w- w= |v? — |w]? = 0. 


-1 -1 3 

AB = Ẹ 11 ] , det A = —5, det B = —24, 
-6 5 8 

det AB = 120(=det A det B), det(A + B) = 

—61(4 det A + det B) 


B 


HINT: For k = 2 use the triangle inequality to show that 
|X. + X2 || < Xx] + |||]; then for k = i + 1 note that 
1 + XQ ++ + M411 S 1 FQ +--+ |] I +l. 


(a) Check n = 1 and n = 2 directly. Then reduce an 
n x n determinant to a sum of (n — 1) x (n — 1) 
determinants and use induction. 


The argument is similar to that for part (a). Suppose 
the first row is multiplied by à. The first term of the 
sum will be Aai11 times an (n — 1) x (n — 1) 
determinant with no factors of A. The other terms 
obtained (by expanding across the first row) are 
similar. 


(b 


0 1 


0 0 


Not necessarily. Try A = | 0 0 


Jess- 8) 


(a) The sum of two continuous functions and a scalar 


multiple of a continuous function are continuous. 


(b) ! 
(i) (af +8g)-h - | (af + Bg)(x)h(x) dx 
0 


1 1 
-f tono dep f g(x)h(x) dx 
0 0 


= af -h + 8g -h 


1 
(i) ra= f f(x)g(x) dx 
0 


1 
>| g(x) f(x) dx =g- f 
0 


In conditions (iii) and (iv), the integrand is a perfect 
square. Therefore, the integral is nonnegative and 
can be 0 only if the integrand is 0 everywhere. If 
f(x) 4 0 for some x, then it would be positive in a 
neighborhood of x by continuity, and the integral 
would be positive. 


21. Compute the matrix product in both orders and check 
that you get the identity. 


23. (det A)(det A71) = det (A A71) = det (1) = 1 


Review Exercises for Chapter 1 


L v+ w= 4i + 3j + 6k 3v= 9i + 12j + 15k; 
6v + 8w= 26i + 16j + 38k; —2v = —6i — 8j — 10k; 
v- w= 4;vx w= 9i + 2j — 7k 
Your sketch should display v, w, 3v, 6v, 8w, 
6v-+ 8w, v- was the projection of valong wand vx w 
as a vector perpendicular to both vand w. 


3. (a) Kt) =—-i+(2+t)j-k 
(b) I(t) = (3t — 3)i + (t + 1)j — tk 
(c) —2x +y +2z=9 


5. 10x + 4y — 2z = 26 


7. (a) 0 
(b) 5 
(c) —10 
9. (a) 2/2 
(b) 5/(2V15) 
(c) —10/(V6V17) 


IL {sta+s(1—t)b|0<t <land0<s <1} 


13. Let v= (a1, a2, a3), w= (bı, b2, b3), and apply the 
CBS inequality. 


11.23. 3 3 10 6 
15. AB=|8 18 3}, BA=]1 11 1 
6 8 3 3 18 18 


17. The area is the absolute value of 


ay a2 =| al a2 | 
bı bo) |by+Aa, bz + Aagl” 


(A multiple of one row of a determinant can be added to 
another row without changing its value.) Your sketch 
should show two parallelograms with the same base and 
height. 


19. The cosines of the two parts of the angle are equal, 
because 


a- V/|/all|| vl = (a- b+ |/al||/bi|) /|vil = b- w/l blivi. 


2L The triangle inequality gives 
II vil = ||(w— w + wi] < ||v— wi + |wi], which implies 
Iivi — w| < ||v— wi. A similar argument gives 
Iwi — Ivi] < Iv — wi. 
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ijk 
3 ixj=| 0 0| = k ac 
010 


25. (a) HINT: The length of the projection of the vector 
connecting any pair of points, one on each line, onto 
(a x @)/lla x æl isd. 


(b) v2 


27. (a) Note that 


lt 2 l 1 0 0 
=X% X X3), = X1 X2— X1 X3— X1 
yı Y2 YB yı Y2-Y1 Y3—Y1 


_ 1yx2-x1 oul 
21y2-Y1 y3— yal 


(b) E 
29. (Plot omitted) 


Rectangular 
(a) (v/2/2, 2/2, 1) 
(b) (3.3/2, 3/2, —4) 


Spherical 
(a) (V2, 2/4, 2/4) 
(b) (5, 2/6, arccos (—4/5)) 


(c) (0, 0, 1) (c) (1, 2/4, 0) 
(d) (0, —2, 1) (d) (V5, 32/2, arccos (/5/5)) 
(e) (0, 2, 1) (e) (V5, x/2, arccos (v5/5)) 


BL z =r? cos20; cos = p sin? ¢c0s20 


33. |x- y| = 6 < v38 = |[xi|llyil; x+ yll = v33 < 
V14 + V7 = ||xl| + Iyl 


35. (a) The associative law for matrix multiplication may 
be checked as follows: 


n n n 
[(AB)C]j = S (AB )ixCK = So So ABC, 
k=1 


k=1 |=1 


n 
= XC Ai(BC)i = [A(BC)]j. 


l=] 


Use this with C taken to be a column vector. 


(b) The matrix for the composition is the product 
matrix. 


37. R’ is spanned by the vectors &, @,...,@. If ve R", 


then 


n n 
Tv=T [Eve] =% (v-@)Te. 
i=1 


i=1 
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Let aij = (T & - @), so that Chapter 2 
n 
Te =) aje. Section 2.1 
j=l L (a) Scalar-valued. 
Then (b) Vector-valued. 


a (c) Vector-valued. 

Tv-@ = > (V-@)aķi. 

i=l 3. (a) c = 0 corresponds to (iii) c = 1 corresponds to 
(ii) c = —1 corresponds to (i). 


That is, if 
(b) c= 12 corresponds to (iv) c = 6 corresponds to (v). 
V1 all ++: aln v1 
v=|:|, thn Tv=]: ale 5. (a) 
Un an1 >+ ânn Un 
as desired. 


39. (a) 70cos@ + 20sine 
(b) (21/3 + 6) ft- Ib 


Al. Each side equals 2xy — 7yz + 5z? — 48x + 54y— 
5z — 96. (Or switch the first two columns and then 
subtract the first row from the second.) 


43. Add the last row to the first and subtract it from the 


second. 
45. (a) Scalar. (c) Vector. 
(b) Vector. (d) Scalar. 
47. (a) i 1-1 i 10 
i si Ii JE sh 


i) 1 -l 
ee at | 


(b) A-187! = E E # (AB), 
piyi E | = (AB) 


ay a2 a3 
bi b2 b3 
C1 2 G3 
(b) 1/3 


1 
49. (a) 6 


5L Use the fact that ||al|2 = a- a, expand both sides, and 
use the definition of € 


53. (1/./38)i— (6/./38)j + (1//38)k 
55. (2//5)i- (1//5)j 
57. (/3/2)i+ (1/2V2)j + (1/2/2)k 


c=-2 


7. The level curves and graphs are sketched below. The 
graph in part (c) is a hyperbolic paraboloid like that of 
Example 4, but rotated 45° and vertically compressed by 
a factor of 1/4. To see this, use the variables u = x + y 
and v = x — y. Then z = (v? — u?) /4. 


(a 


Intersection 
with xy plane 


\ 
\ š 
N \ Intersection 
N 1 . 
1 with xz plane 


~ 
e 
~. 
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Section x =y 


9. (a) f(x,y,z) = x2y®— 2z, c=3 
(b) g(x, y) = (xy — 3)/2 


IL For Example 2, z =r(cos@ + sin) + 2, shape depends 
on 6; for Example 3, z = r2, shape is independent of @; 
for Example 4, z = r2(cos? 6 — sin? 6), shape depends 
on ð. 


13. The level curves are circles x? + y? = 100 — c? when 
c < 10. The graph is the upper hemisphere of 
x2 + y? +z? = 100. 


15. The level curves are circles, and the graph is a 
paraboloid of revolution. See Example 3 of this 
section. 


17. ifc = 0, the level curve is the straight line y = —x 
together with the line x = 0. If c 4 0, then 
y = —X +(c/x). The level curve is a hyperbola with the 
y axis and the line y = —x as asymptotes. The graph is 
a hyperbolic paraboloid. Sections along the line y = ax 
are the parabolas z = (1 + a)x2. 


19. ifc > 0, the level surface f(x, y, z) = c is empty. If 
c = 0, the level surface is the point (0, 0, 0). If c < 0, 
the level surface is the sphere of radius ./—c centered at 
(0, 0, 0). A section of the graph determined by z = a is 
given by t = —x? — y? — a2, which is a paraboloid of 
revolution opening down in xyt space. 


2L Ifc <0, the level surface is empty. If c = 0, the level 
surface is the z axis. If c > 0, itis the right-circular 
cylinder x? + y? = c of radius /€, whose axis is the z 
axis. A section of the graph determined by z = a is the 
paraboloid of revolution t = x? + y2. A section 
determined by x = b is a “trough” with parabolic cross 
section t(y, z) = y? + b?. 
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23. Setting u = (x — z)/./2 and v = (x +.z)/+/2 gives the 3L An elliptic paraboloid with axis along the x axis. 
u and v axes rotated 45° around the y axis from the x 
and z axes. Because f = vy4/2, the level surfaces x 


f = c are “cylinders” perpendicular to the vy plane 

(z = —x) whose cross sections are the hyperbolas 

vy = C/4/2, so the section S,_a N graph f is the 
hyperbolic paraboloid t = (z + a)y in yzt space [see 
Exercise 7(c)]. The section Sy_p N graph f is the plane 
t = bx + bz in xzt space. The section Sz=p N graph f 
is the hyperbolic paraboloid t = y(x + b) in xyt space. 


9y? + 4z? = 144 


25. Ifc < 0, the level curve is empty. If c = 0, the level 
curve is the x axis. If c > 0, itis the pair of parallel lines 
ly| = c. The sections of graph with x constant are 
V-shaped curves z = |y| in yz space. The graph is 
shown in the accompanying figure. 


Zz 


33. The value of y does not matter, so we get a “cylinder” of 


parabolic cross section. 
—> y 


(Exercise 25) z 


27. The value of z does not matter, so we get a “cylinder” of 
elliptic cross section parallel to the z axis and 
intersecting the xy plane in the ellipse 4x? + y? = 16. 


35. This is a saddle surface similar to that of Example 4, but 
the hyperbolas, which are level curves, no longer have 
perpendicular asymptotes. 


Eg y eS, À A 
x Lai 
aS 1o 
aN 1o 
Ss 2Vid A 
ss ” 
s5 Ka 
s yg- = =-s7,¢ 
es a 
oS - ie 
1 Sse? A 

1 ae x 


29. The value of x does not matter, so we get a “cylinder” 
parallel to the x axis of hyberbolic cross section Level curves 
intersecting the yz plane in the hyperbola z? — y? = 4. 
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Section 2.2 
L Nothing. 
3. (a) 0 (b) —1/2 (c) 1 
5. (a) 5 (b) 0 (c) 2x 
7. €3/10 

37. A double cone with axis along the y axis and elliptical 9. (a) 0 (b) —1/2 (c) 0 


cross sections. 
F4 


IL (a) Compose f(x, y) = xy with g(t) = (sint) /t for 
3 =4x? +222 t = 0 and g(0) = 1. 


6 =2x2 +722 — (b) 0 (c) 0 
Y B (a) 1 
(b) [Xo 
(c) (1, e) 
x (d) Limit doesn’t exist (look at the limits for x = 0 and 


y = 0 separately). 


39. Complete the square to get 
(x +2)? + (y —b/2)? + (z+ 9)? = (b? + 4b +97) /4. 
This is an ellipsoid with center at (—2, b/2, —3) and 17. 0 
axes parallel to the coordinate axes. " 


15. Everywhere except (0, 0). 


19. If (xo, yo) € A, then |xọ] < 1 and |yo| < 1. Let 
r = min(1 — |xo], 1 — |yo|). Prove that D; (xo, yo) C A 
either analytically or by drawing a figure. 


AL Level curves are described by cos 20 = cr2. If c > 0, 
then —7/4 < 0 <7/40r37/4 <6 < 57/4. If c <0, 
then 7/4 < 0 < 32/4 or 5x/4 < 0 < 77/4. In either 


case you get a figure-eight shape, called a lemniscate, 2L Letr = min(2 — /x? a ya, /x? a ya =a). 
through the origin. (Such shapes were first studied by 
Jacques Bernoulli and are sometimes called B ernoulli’s 23. Use parts (ii) and (iii) of Theorem 4. 
lemniscates. ) . 
y 25. (a) Let the value of the function be 1 at (0, 0). 
(b) No. 
zl. 
e] c<0 27. For |x — 2| < ô= /e +4 -— 2, we have 


|x? — 4] = |x —2|]x + 2| < 5(8 + 4) =e. By 
Theorem 3(iii), limity_,2 x2 = (limity_,2 X)? = 22? = 4. 


B 


Letr = ||x— y|//2. If |Iz— yl <r, let 
f(3 = ||Z—y\/r. If z- yli >r, let f(3 =1. 


3L (a) limity_.p+ f(x) = L if for every « > 0 there is a 
ô > 0 such that x > b and 0 < x — b < dS imply 
If(x)—- L] <e. 
(b) limity_,9- (1/x) = —oo, limitt—oo € = 0, and so 
limity_,9- e!/* = 0. Hence 
limity_,9- 1/(1 + e!/*) = 1. The other limit is 0. 


33. If > 0 and % are given, let 5 = (e/K )}/. Then 
LFO) — f (2%) |] < K 6% = e whenever ||x— X|| < ô. 
Notice that the choice of 5 does not depend on x. This 
means that f is uniformly continuous. 


35. (a) Choose 5 < 1/500. 
(b) Choose é < 0.002. 


Section 2.3 


L (a) af/ax = y; af /ay =x 
(b) af/ax = ye*Y; af /ay = xe” 
(c) af/ax = cosx cos y — x sin x cos y; 
af /ay = —x cos x sin y 
(d) af/əx = 2x[1 + log (x? + y?)]; 
af /ay = 2y[1 + log (x? + y?)]; (x, y) # (0, 0) 


3. (a) aw/ax = (1+ 2x?) exp (x? + y?); 

aw /ay = 2xy exp (x? + y?) 

(b) aw /ax = —4xy?/(x? — y2)?; 
aw /ay = 4yx2/(x? — y?)? 

(c) aw/ax = yeXY log (x? + y?) + 2xeXY /(x2 + y?); 
aw /ay = xe log (x? + y?) + 2yeXY /(x? + y?) 

(d) aw/ax = 1/y; aw /ay = —x/y? 

(e) aw/ax = —y2e*Y sin yeXY sin x + cos ye* cosx; 
aw /ay = (xyexY + eXY)(—sin yexY sin x) 


5. z = 6x +3y—11 


7.z=x-y+1 


aa fy 


eY xeY —siny 
(b) i 0 | 
1 ey 
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(ytxy2jexY (x + x2y)eXy 
(d) sin y x cosy 
5y2 10xy 
IL z =2(x —4) + 4y — 3) +10 
1B. Atz=1. 
15. Both are xye*Y. 
T7. (2, 6, —36) 


19, (a) vf = (ey -7 (—2x? +1), 
ye =y? =z, hee ey 


(b) Vf = (x2 + y? +.22)-2(yz2( y? +z? — x2), 
xz(x2 +z? — y2), xy(x2 + y? — z?)) 

(c) Vf = (z?e* cosy, —z2e* sin y, 2ze* cos y) 

2L 2x+6y-z=5 

23. I(t) = (1, 3, 20) + t(—1, 2, 8) 

25. —2k 

27. They are constant. Show that the derivative is the zero 
matrix. 

Section 2.4 


L This curve is the ellipse ( y2/16) + x? = 1: 
y 


3. This curve is the straight line through (—1, 2, 0) with 
direction (2, 1, 1): 
Z 
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5. (a) dt) = (2cos(t), 2sin(t)), t € [0, 27] 13. —2cost sinte!"t + sintt + cos? teSnt — 3 cos? t sin? t 
(b) et) = (2sin(t), 2cos(t)), t € [0, 27] for both (a) and (b). 
(c) qt) =(2cos(t) +4, 2sin(t) + 7), t € [0, 27] 15. (2, 0) 


7. 61+ 6tj+ 3t’k 17. (a) h(x, y) = f(x, u(x, y)) = f( p(x), u(x, y)). We 
9. (—2cost sint, 3 — 3t2, 1) use p here solely as notation: p(x) = x. 


—(at _¢ 
eh cele) ean əh afdp af au af . af au 


Written out: = = 4 
13. c(t) = (t cost + sint, 4) əx apdx dudx ap au ax 
15. Horizontal when t = (R/v)nz, n an integer; speed is because ap = ox =1 
zero if n is even; speed is 2v if n is odd. dx dx 


17. (sin3, cos3, 2) + (3cos3, —3sin3, 5)(t — 1) JUSTIFICATION: Call ( p, u) the variables of f. To 


use the chain rule we must express h as a 
set aie composition of functions; that is, first find g such 
2L (8, 0,1) that h(x, y) = f (g(x, y)). Let g(x, y) = (p(x), 


u(x, y)). Therefore, Dh = (Df )(Dg). Then 


la ay | 


23. (a) /1+ 64r? 
(b) Yes, whent = 0. 
(c) Kt) = (1, 0, 1672) + t(0, 1, 87) 


əh əh _ rəf of 
(0) (1, 27,0 foe ay tap a be aA 
ðX ð 
25. (a) (f o o(t) = (tê — t4, 2t5, 4t2) 1 6 : 
(b) Kt) = (0, 2, 4) + t(2, 10, 8) -T3 a fa n 
ap all> = 
f ax ay 
Section 2.5 əf af au af au 
=h | 
L Use parts (i), (ii), and (iii) of Theorem 10. The derivative Ldp ðU ax du dy 
at xis 2( f(x) + 1)Df (x). 
3. Compute each in two ways; the answers are əh əf sof au 
and so — = — + — —. You may see 
(a) (f 0€)/(t) =et(cost — sint) ox op- eet 
(b) (F o @'(t) = 15t4 exp (3t5) ah af oF au as an answer. This requires 


ax ax du ax 

careful interpretation because of possible ambiguity 
about the meaning of of /ax, which is why thename 
p was used. 


əh əf əfd əf d 
7. (fog)(x, y) = (tan (e*-¥—1)—e*-Y, e?*-Y) —(x—y)?) (b) = 4 


0 0 əx dx  oudx  dvdx 
and {FogL=[> 9) g oh _ of au, af av, af aw 
əx auax  dvax daw dx 


(c) (f o@/(t) = (e? — e-*)[1 + log (e?! + e7%)] 
(d) (fo €!(t) = (1+ 4t2) exp (2t2) 


5. Use Theorem 10(iii) and replace matrices by vectors. 


9. 1 os(1) cos (log V2) 


2 19. (a) G(x, y(x)) =0 andso A 
IL (a) plt) = (3sin(t) + 2, 1, cos(t) + t?), ox- Oy AX 
P(x) = (—3, 0, 27) dy aG1 əG17 7! pay 
(b) dx) =(-1,0, x), c(x) = (0, —1, 1), 5 “dx. dy1 əy2 ax 
03 0 D lay. | ~~ | aG2 aG2] | aG2 
Df(-1,0,7)=|-2 0 0 tx ayı at Lax 
1 0 2x 


(c) (—3, 0, 2x) where ~1 means the inverse matrix. 


The first component of this equation reads 


0G10G2 40G20Gj 
dyi əx əy2 əx ə3y2 
dx 3G1əðG2 9G2 dG, ` 
dy1 əy2  əðyı 3y2 
dy —2x 
(= 3y? + ey 


21 Apply the chain rule to 3G /aT , where G(t(T, P), 
p(T, P), V(T,P)) =P(V —b)e?/RYT _ RT is 
identically 0; t(T, P) =T; and p(T, P) =P. 

23. Define Ri(h) = f(x + h) — f(x) — [Df (20) ]h 

25. Let gi and g2 beC 1 functions from R3 to R such that 
gi (x) =1 for IIx] < V2/3; 9109 =0 for |x] > 22/3; 


g2(x) = 1 for |x— (1, 1, 0)|| < 2/3; and g2(x) = 0 
for ||x— (1, 1, 0)|| > 22/3. (See Exercise 24.) Let 


1 0 O07 [x1 1 
0 0 Of [x3 0 


0 0 -11 [x1 
and h2(x) = f 0 4 fe ; 
0 0 1J Lx 


and put f(x) = gi(x)hi(x) + 92h29. 


27. Proof of rule (iii) follows: 


h(x) — h(x) — [ f (xo) Dg (x0) + 9 (Xo) Df (x) ](x— X)| 


|x — xoll 
1g(x) — g(%) — Dg(%)(x— xo)| 
\|X— Xo|| 
| f(x) — f(x) — Df (X0)(x— x)| 
|X — Xo || 
| f(x) — f(xo)| 190) — 9 (%)| 
|X— x| Ix- XI 


< | f(Xo)| 


+ |9(%)| 


|X— X) ||. 


As X—> %, the first two terms go to 0 by the 
differentiability of f and g. The third does so because 
f(x) — f(x)1/1x— x|] and |g (x) — g(x) 1/11x— xoll 
are bounded by a constant, say M , on some ball D, (xo). 
To see this, chooser small enough that [ f (x) — f (xo)]/ 
|X — Xo|| is within 1 of Df (xo)(x— x) /I|X — xo || if 
|X— || <r. Then we have | f(x) — f (%9)|/|]K— ol] < 
1+ |Df (%)(x— %)[/I.X— |] = 1+] VF (x) - (X—%)|/ 
|X — Xo|| < 1+ || Vf (xX) || by the Cauchy- Schwarz 
inequality. 

The proof of rule (iv) follows from rule (iii) and the 
special case of the quotient rule, with f identically 1; 
that is, D(1/g)(xo) = [-1/g(%)?]Dg (%9). To obtain 


Answers to Odd-Numbered Exercises 505 


this answer, note that on some small ball 
Dr (Xo), g(x) >m > 0. Use the triangle inequality and 
the Schwarz inequality to show that 


1 1 1 
Jog 9) gh)? 
|X — X || 
— 2 21900 = 906) — Dg(x)(x- >) | 
= 1900] 190%) | |K— Xo || 
19(%) — g(Xo)| |Dg(%)(x— x)| 
1g(x)|g(x0)4 Ix- ll 
< 2 1909 = 90%) — Dg(%)(x-— X)| 
Tm? Ix- x |) 


IV g(%) I 
oo 


Dg(Xo)(x— X) 


19(x) — 9(%)|. 


These last two terms both go to 0, because g is 
differentiable and continuous. 


29. First find formula for (8/ax)(F (x, x)) using the chain 
rule. Let F (x, z) = fy f(z, y)dy and use the 
fundamental theorem of calculus. 


3L By Exercise 28 and Theorem 10(iii) (Exercise 27), each 
component of k is differentiable and Dki (xo) = f (xo) 
Dg; (x0) + gi (%9) Df (%9). Because [Dgj (Xo) ly is the ith 
component of [Dg(Xo) ly and [Df (xo) lyis the number 
Vf (xo) - y, we get [Dk(xo) ly = f (x9)[Dg(x) ly + 
[Df (0) lylg (xo )] = f (xo)[Dg (xo) ly + [Vf (x0) - ylg (xo). 


33. 4 


35. Let g(x, y) =x — y, so that z = f og. Then the chain 
. ., Oz df dg əf 
rules implies — = — — = — and 
əx agax ag 
əz of ag__iaf 
əy dgady ag’ 


Section 2.6 
L vf(1,1, 2)-v= (4, 3, 4) -(1/V5, 2/5, 0) = 2/5 


3. (a) 17e°/13 
(b) e//3 
(c) 0 


5. (a) Since V f (xo) points in the direction of fastest 
increase (Theorem 13), the maximum value of the 


directional derivative is Df — = 
BATI 
V f(x) 


v f(x): IVf) = |V f (%0) ||. 
(b) 21/2 
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7. 9//29 

9. (a) z+ 9x =6y -6 
(b) z +y = x/2 
(c) z=1 


1 

a oe 

IL (a) z3 +K 
(b) 21+ 2j+2k 


(c) -2i+j+h 


15. The graph of f is the level surface 
0 =F (x,y,z) = f(x, y) — z. Therefore, the tangent 
plane is given by 


0 = VF (Xo, Yo, Zo) (xX — Xo, Y — Yo, Z — Zo) 


of of 
= (žo yo), By (Xo yo), -1) 


-< (X — Xo, Y — Yo. Z — Zo). 


Because zo = f (Xo, yo), thisisz = f (Xo, Yo) + 
(df /3x) (xo, Yo)(X — xo) + (OF /dy)(Xo, Yo)(y — Yo). 


17. (a) Vf =(z + y,z + x,X + y), 
g(t) = (et, — sint, cost), 
(f og'(1) =2ecos1 + cos? 1 — sin? 1 
(b) Vf = (yzeX¥?, xzeX¥?, xyeX¥2), g(t) = (6, 6t, 32), 
(f og/(1) = 108e18 


(c) VF =[1+log(x? + y? +z?) (xi+ yj + zk), g = 
(et, -et 1), 
(f og/(1) =[1+log(e? +e-2 +1) ](e? -e-2 +1) 


19. (a) (0, 0) 
(b) V (0,0) = (—4x, —6y)|(o,0) = (0, 0) 
2L Let f(x, y, zZ) = 1/r = (x2 + y? 4.22) -1/2; 
r= (x, y, z). Then we calculate Vf = — 
(x? + y? +22)-3/2(x, y, z2) = —(1/r3)r. 


23. vf = (g'(x), 0) 


Di 


Df(0,0,...,0) =[0,..., 0] 


27. œ = [—(0.03 + 2byı)/2a]i + yij, db = 
[—(0.03 + 2by2) /2a]i+ y2j, where yı and y2 are the 


2 
solutions of (a? + b2)y? + 0.03by + (= E a?) =0. 


3L Crosses at (2, 2, 0), 5/10 seconds later. 


Review Exercises for Chapter 2 


L (a) Elliptic paraboloid. 


(b) Lety’ = y + 3 and writez = xy’. This is a (shifted) 
hyperbolic paraboloid. 


2xy x2 | 


3. (a) Df (x, y) = [yey —xeT Y 


(b) Df (x) = i 


(c) Df(x, y,z)] =[e% e” e] 


10 0 
(d) Df(x, y,z) = F 1 j 
0°-0 -1 

0 0 0 

5. f —2n | 


-12 28 

7. -s J 
-4 -5 

9. (0, 257, 0) 


ITL The plane tangent to a sphere at (Xo, Yo, Zo) is normal to 
the line from the center to (Xo, Yo, Zo). 


13. (a) z=x-—y+2 
(b) z= 4x — 8y —8 
(c) X+y+z=-1 
(d) 10x + 6y — 4z = 6- x 
(e) 2z = V2x + /2y 
(f) x+2y-z=2 


15. (a) The level curves are hyperbolas xy = 1/c: 


(b) c=x xy- y? 


1+5 1- v5 
IEn Cara 


T7. (a) 0 


(b) Limit does not exist. 


19. (1 + 2x2) exp(1+ x2 + y?) 


40 
2L —e-" 
V5 


23. (a) Theline L(t) = (xo, yo, f (xo, yo)) + t(a, b, c) lies 
in the plane z = f (Xo, yo) if c = 0 and is per- 
pendicular to Vf (Xo, yo) if a(af /9xX)(Xo, Yo) + 
b(db/dy)(Xo, Yo) = 0. On L, we have 


f f 
f (Xo, Yo) + [Eo yo)| (x — Xo) + [Fo yo)| (y — yo) 


f f 
= F (Xo, yo) +at| lo, ¥0)| +o 000 ¥0)| 
= f(Xo, Yo) =Z. 


Therefore, L lies in the tangent plane. An upward 
unit normal to the tangent plane is p= (1 + || Vf 71⁄2 
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(—(af/əx)(xo, Yo), —(af /day)(Xo, Yo), 1). Therefore, 
cosé = p- k= (1+ ||Vf||2)~2/2, and tan = sin 8 / 
coso = {Vf 12/1 + IVE I2)}4/2 (1+ IVE 12) 712 
= ||Vf || as claimed. 

The tangent plane contains the horizontal line 
through (1, 0, 2) perpendicular to 

vf (1, 0) = (5, 0), that is, parallel to the y axis. It 
makes an angle of arctan (|| Vf (1, 0) ||) = 

arctan 5 ~ 78.7° with respect to the xy plane. 


25. (1//2, 1//2) or (—1/V2, -1//2) 


27. A unit normal is (./2/10)(3, 5, 4). The tangent plane is 
3x + 5y + 4z = 18. 


(b 


29. 4i+16j 


3L (a) Because g is the composition à + àx => f (àx), 
the chain rule gives 


X1 
g'(à) = Df (Ax) 
Xn 
Thus, 
X1 
g'(1) = Df (x) = Vf(x)-x 
Xn 


But also g(a) = AP f(x), so g'(à) = paP-! f (x) 
and g’(1) = pf (x). 
(b) p=1 


33. Differentiate directly using the chain rule, or use 
Exercise 31(a) with p = 0. 


35. (a) If (x, y) # (0, 0), then we calculate for (i) that 

af /ax = (y? — yx2)/(x2 + y2)? and af /ay = 

(x3 — xy?) /(x? + y2)2. If x = y = 0, use the 
definition directly to find that both partial derivatives 
are 0. For (ii), if (x, y) 4 (0, 0), then of /ax = 2xy6/ 
(x? + y4)2 and af /ay = (2x4y — 2x2y9)/ 

(x2 + y4)2. The partials at the origin are zero. 

The function (i) is not continuous at (0, 0); the 
function (ii) is differentiable, but the derivative is 

not continuous. 


(b 


37. (a) V2x/8 
(b) — sin v2 
(c) —2./2e-2 
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39. (—4e-!, 0) 


4L (a) SeeTheorem 11. 
(b) 
g(u) = (sin 3u)? + cos 8u 


g’(u) = 6sin 3u cos 3u — 8 sin 8u 
g'(0)=0 


Vf = (2x, 1) 


g/(0) = Vf (K(0)) - (0) 
= (0, 1)-(3,0) =0 


43. t = /14(-3 + 359) /70 = (—3 + V359) /5V T4 


45. az/ax = 4(e- 2X29 +247) (1 + y) /(e7 2X29 — @2xy)2 
az/dy = 4(e-2¥-2X+2XxY)(1 + x) /(e-2X—-2Y — @2xy)2 


47. Notice that y = x2, so that if y is constant, x cannot be a 
variable. 


49. [f’(t)g(t) + f(t)g"(t)]expl f(t)g(t)] 


5L di f(dt))]/dt = 2t/[(1 + t? + 2cos?t)(2 — 2t? + t*)] 
—At(t? — 1) In(1 + t? + 2 cos? t) /(2 — 2t? + t4)? 
—4 cost sint/[(1 +t? + 2 cos? t)(2 — 2t? + t4)] 


53. Letx = f(t), y = t, and use the chain rule to 
differentiate u(x, y) with respect to t. 


55. (a) n=PV/RT;P =nRT/V; 
T =PV/nR;V =nRT/P. 
(b) aV/aT =nR/P;aT /aP =V/nR; 
aP /aV = —nRT/V2. 
Multiply, remembering that PV =nRT. 


g 


We can solve for any of the variables in terms of the 
other two. 


(b) aT /aP = (V — 8)/R; 

aP /aV =—RT/(V — 8)? + 2a/V 3; 

ƏV /aT = R/[(V — RT /(V — £)? — 2a /V 3)] 
(c) Multiply and cancel factors. 


59. (a) (1/./2, 1/2) 
(b) The directional derivative is 0 in the direction 


(xoi+ yoj)/ v xé + yê. 


(c) The level curve through (xo, yo) must be tangent to 
the line through (0, 0) and (xo, yo). The level curves 


are lines or half-lines emanating from the origin. 


6L G(x,y)=x-—y 


vf(h(0)) = V f (0, 1) = (0, 1) 
hH(u) = (3 cos 3u, —8 sin 8u) 


T17. 


Chapter 3 
Section 3.1 
2 Jea 242 _y3 3 
t ea H o EIER 
OX Gay)? ay? (x2 + y2)4 
af af — —6x* +36x?y? — 6y# 
axoy  ayax | (x2 + y2)4 
af 4 2 
aya —y" cos(xy*), 
at —2x sin (xy?) — 4x?y? cos (xy?) 
əy? ' 
a ya 2y sin(xy*) — 2xy? cos(xy*) 
5 a?f  2(cos? x + eY) cos2x + 2sin? 2x 
ax2 (cos? x + e-Y)3 ' 
af eY — cos? x 
ay? e¥(cos¢ x + e-Y)3 
f tf 2 sin 2x 
axay  ayax  eY(cos? x +e-Y)3 
7. (a) fx = -1, fy =-T, Txx = 0, fyy => 0, 


fxy = fyx = —1 
(b) fx = 5, fy = —18, fxx = 2 fyy = 42, 
fxy = fyx = —8 


a? f 
ate 


(c) All second partial derivatives are zero. 
No. 
a2 f a? f 
2 = 2G = 
(a) c a= c4 sin(x — ct) = a2 
a? f a? f 
2075 phe ; Z 
(b) c 2 c* sin(x) sin(ct) ay. 
a’ 2 2 
(c) Ca = 30c?(x —ct)* + 30c2(x + ct)4 = = 
(a) a2z/ax2 = 6, a2z/ay? = 4, 
a2z/x Əy = a2z/dy ax = 0 
(b) a2z/ax? = 0, a2z/ay2 = 4x/3y3, 


a2z/dx dy = 3?z/ðy ax = —2/3y2 


fyy = 2x +2y, fyz = 22, fzx = 0, fxyz =0 


Because f and af /az are both of class C 2, we have 


dx dyaz  axdy az ayax az ay \ ax az 


8 fot \ æf 
~ ay \azax } ay azax’ 


af a2 əf a2 əf (a 


19. fyow = fzwx = e%¥2[2xy cos(xw) + 
x2y2z cos(xw) — x2yw sin(xw)] 


of X xy 

2L (a) 3x = arctan y + x+y? 
af x? 
ay ay? 
af ys af xy 
Ox? (x2 + y4)2" ay? (x2 + y?)? 
af af —2xy? 


əxəy ayax (x+y?) 


(b) af _—xsin yx? +y? af _ -ysin yx? + y? 
ax [x24 yr?" ay /y2 4 y? 
af x?sin/x2 +y? x? cos y/x? + y? 
ax (x2 yA x+y 
sin \/x2 + y? 
BEETH 
af  y?sin/x2 +y? -y? cos \/x2 + y? 
sin \/x2 + y? 
+y?) 
af af 
ax ay ay ax 
o fsiny/x2 +y? — cos yx? +y? 
(c) r = —2x exp (—x? — y?), 
of _ 3 2 
Ti 2y exp(—x* — y*), 
af 2 x2 
ye ae 2) exp(—x* — y^), 
af = (4y? — 2) exp (—x? — y?) 
ae p ver 
af af ee 
ax oy ayax T PY OPI y) 
a?f /dx\2 af dxdy af sdyy? 
23. xz (dt) + 2 ay at tt at lat) 
af ax | af dèy 
ax dt? ay dt?’ 


where qt) = (x(t), y(t)) 


25. Evaluate the derivatives 92u /ax2 and a2u/ay2 and add. 


Answers to Odd-Numbered Exercises 509 


27. (a) The first function is harmonic, the second is not. 
(b) Any polynomial of degree 1 or 0 is harmonic. 


29. (a) Evaluate the derivatives and compare. 


(b) 
$ 


N x 
x=t 


3L V =—GmM /r = —GmM (x2 + y? + 22)-1/2, Check 
that 


vV əV əv 
= GmM x2 2 72 —3/2 
ae ae Hoz (x+y +2*) 


[3 — 3(x2 + y? + z?) (x2 + y? + z2?)71] = 0. 


Section 3.2 


L (a) f(hı,h2) =1 +h +h2 + R1ı(0 h) 
(b) f (hy, hz) =1+h1+h2+ $h? +hıh2 + 4h + 
R2(0 h) 


3. f(hy, h2) =h? +2hih2+h2 [R2(0 h) = 0, in this case]. 


h2 h2 
5, F(ha, h2) =1+h1 +h2+ => +hihg +> + R2(0 h) 


7. f(hy, h2) = 1+h1h2 + R2(0 h) 


1 2 1 mT 2 
9 9(x, y) = -1+ 31x- n) +50 >) 


9-9-3 (0-9) 


2 
bd 2 Tx d 
+Z- -Z (x-3) 


IL p(x, y)=2 


13. (a) Show that |Rx(x, a)| < AB‘+!/(k + 1)! for 
constants A, B, and x in a fixed interval [a, b]. 
Prove that Rk + Oask — oo. (Use convergence 
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of the series X` ck 7k! = e€ and use Taylor's 
theorem.) 


The only possible trouble is at x = 0. Use 
L'H ôpital’s rule to show that 


(b 


limit p(t)et = 00 
too 


for every polynomial p(t). Using this, establish that 
limit, _, 9+ p(x)e7t/x = 0 for every rational function 
p(x), and conclude that f*)(0) = 0 for every k. 


(c) f:IR" — Ris analytic at x if the series 


a 


DSN le hil 
mr 2 ax ta A ax ax) o E 


LJ= 


converges to f (Xo + h) for all h= (hi, ..., hp») in 
some sufficiently small disc ||hl| < £. The function 
f is analytic if for every R > 0 there is a constant 
M such that |(a* f/axi, --- Əxi) OQ] < M K for each 
kth-order derivative at every x satisfying ||x|| < R. 


(d) f(x,y) =1+x+y+ 4b? + ays yee. 
k 


1 KX. ep 
— jyk-J 4... 
a a 
j=0 
Section 3.3 

L (0, 0); saddle point. 

3. The critical points are on the line y = —x; they are local 
minima, because f(x, y) = (x + y)? > 0, equaling zero 
only when x = —y. 

5. (0, 0); saddle point. 


1 1 Pan 
7. (—j.-j) toc minimum. 


9. (0, 0); local maximum. (The tests fail, but use the fact 
that cosz < 1.) 


(VTZ, /x/2), local minimum 


(0, /7), local minimum. 
IL No critical points. 
13. (1,1) is alocal minimum. 


15. (0, nz); critical points, no local maxima or minima. 


17. Local minimum at (1, 1). 


19. Local maximum at (0, 3), local minimum at (1, 2), 
saddles at (0, 2) and (1, 3). 


2L Minimum at (0, 0) and maxima at (0, +1) [and saddles 
at (+1, 0)]. 


23. (a) əf/əx and df /ðy vanish at (0, 0). 
(b) Show that f(g(t)) = 0 att = 0 and that 
F(g(t)) > O if It] < |b|/3a2. 
(c) f is negative on the parabola y = 2x2. 


25. The critical points are on the line y = x and they are 
local minima (see Exercise 1). 


27. Saddle. 


29. Minimize S = 2xy + 2yz + 2xz with z = V /xy, V the 
constant volume. 


3L 40, 40, 40 


33. (a) Vf(0, 0) = (6x° + 2x, 6y?)|(0,0) = (0, 0), so f 
has a critical point at (0,0). The Hessian matrix of f 


_ |30x44+2 0 _ {2 0 
at (0,0) is È wl, r = f 0 


has determinant zero. Similarly for g and h. 


f has a local minimum at (0,0) since f (0, 0) = 0 
and f(x, y) > 0 for all other (x, y). g has a local 
maximum at (0,0) since g(0, 0) = 0 and 

g(x, y) < 0 for all other (x, y). h has a saddle at 
(0,0) since h(0, y) > 0, but h(x, 0) < 0 for x near 
zero. This shows that there are points arbitrarily 
close to the origin on which h takes both positive 
and negative values. 


| hie 


(b 


35. The only critical point is (0, 0, 0). Itis a minimum, 
because 


x2 +y? 1 
f(x,y,z) > “try = sy ez > 0. 


f ; 27\. na 
37. (. z) is a saddle point; (5, z) isa local minimum. 
3: , , 
39. = is the absolute maximum and 0 is the absolute 
minimum. 
4L —2isthe absolute minimum; 2 is the absolute maximum. 


43. Absolute maximum of 3 at (—1, 2), absolute minimum 
of 0 at (—2, 5) and (2, 1). 


45. (5.4) is a local minimum. 


47. If un(x, y) =u(x, y) + (1/n)e%, then V?un = 
(1/n)e* > 0. Thus, Un is strictly subharmonic and can 
have its maximum only on 3D , say, at ph = (Xn, Yn). If 
(Xo, Yo) € D, check that this implies u(Xn, Yn) > 
u(xo, Yo) — e/n. Thus, there must be a point q = (Xoo, Yoo) 
on aD such that arbitrarily close to qwe can find an 
(Xn, Yn) for n as large as we like. Conclude from the 
continuity of u that U(Xoo, Yoo) = u(xo, Yo). 


49. Follow the methods of Exercise 47. 


5L (a) If there werean xı with f(x1) < f (xo), then the 
maximum of f on the interval between xo and x 
would be another critical point. 


(b) Verify (i) by the second derivative test; for (ii), f 
goes to —co as y > œo and x = —y. 


Section 3.4 


L (a) Maximum of 3, minimum of 1. 
(b) Maximum of 3, minimum of 0. 


3. Maximum at yi —1, 1), minimum at 


TE 1, -1). 


5. Maximum at (v3, 0), minimum at (—v3, 0). 


oe a aoe ae 4) 

J70' 70 vT’ vT 

9, The minimum value 4 is attained at (0, 2). U se a 
geometric picture rather than Lagrange multipliers. 


7. Maximum at ( ), minimum at ( 


IL (0,0, 2) isaminimum of f. 


13. 3 is the absolute maximum and 0 is the absolute 
minimum. 


15. (a) Saddle at (0,0). 


(b) Maximum of 14 at (3, 1), minimum of —14 at 
(—3, —1). 


I7. Maximum of 1/33, minimum of —1/3V3. 


19, (a) 3, 3,3 
(b) 9,9,9 
2L The diameter should equal the height, 20/./2z cm. 


23. Maximum value /3 at (+ 1 — 4.) and minimum 


V3'V3' VB 
value —/3 at ( re A 5) 
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25. Horizontal length is ./qA/p, vertical length is /pA/q. 


27. For Exercise 3, the bordered Hessians required are 


0 2x 2y 
|H2} =|2x -2a 0 | =8a(x?+y%), 
2y 0 -2 


0 2x 2y 22 


Hap = [2% 24 8 Mea 4 y2 4-22), 


At gau —1, 1) the Lagrange multiplier is 4 = v/6/4 > 0, 
indicating a maximum at fl. —1,1), anda = 


—4/6/4 < 0 indicates a minimum at TEEI 1, —1).In 


Exercise 7, |H | = 24a(4x2 + 6y2), and so à = /70/12 > 0 
indicates a maximum at (9/./70, 4/./70) and à = —/70/ 
12 < 0 indicates a minimum at (—9/./70, —4/./70). 


29. 11,664 in? 


3L (a) Vf(x) = Ax 
(b) S is defined by the constraint function 
g(x) = x? + x5 + x2 — 1. Because Vg(x) = 2xis 
not Q Theorem 9 applies. At an xwhere f is 
extreme, there is a 4/2 such that 
VE (x) = (A/2)/Vg(x). That is, Ax= Ax 


33. Minimum is (—1//2, 0), maximum is (4, +/7/8), 
local minimum at (1/4/2, 0). 


35. No critical points; no maximum or minimum. 
37. (—1, 0, 1) 


39. The point (K, L) = («B /q, (1 — æ)B /p) optimizes the 
profit. 


Section 3.5 


L Let F(x, y,z) =x + y —z+cos(xyz). Then 
(aF /dz)(0, 0,0) = —1 40. The implicit function 
theorem then says that we can solve for z = g(x, y). 
(dg/dx)(0, 0) = (dg/dy)(0, 0) = 1. 


3. (a) Ifx < -}, we can solve for y in terms of x using 
the quadratic formula. 
(b) aF /ay = 2y + 1 is nonzero for {y | y < —4} and 
{y |> —}}. These regions correspond to the upper 
and lower halves of a horizontal parabola with 
vertex at (— f, —}) and to the choice of sign in the 
quadratic formula. The derivative 
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5. 


17. 
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dy/dx = —3/(2y + 1) is negative on the top half of 
the parabola, positive on the bottom. 


Use the special implicit function theorem with n = 1. 
(See Example 1.) Line (i) is given by 

0 = (x — Xo, y — Yo) + VF (xo, Yo) = 

(x — xo)(ƏF /əx)(xo, Yo) + (y — yo)(ƏF /dy)(Xo, Yo). 
For line (ii), Theorem 11 gives dy/dx = 

—(aF /əx)/(aF /ay), and so the lines agree and are 
given by 


(OF /Əx)(Xo, Yo) 
(dF /dy)(Xo, Yo) 


y= Yo 


, Let F(x, y, Z) =x3z2 —z3yx; 


aF /az = 2x3z — 3z?yx # 0 at (1, 1, 1). Near the 
origin, with x = y + 0, we get solutions z = 0 and 
z = x, and so there is no unique solution. At (1, 1), 
dz/0X = 2 and 0z/ay = —1. 


With Fi = y +x +uv and F2 = uxy + v, the 
determinant in the general implicit function theorem is 


dF 1/au 
aF2/du 


OF1/dv 


OF2/dv Ys 


which is 0 at (0, 0, 0, 0). Thus, the implicit function 
theorem does not apply. If we try directly, we find that 
v = —uxy, So x + y = u2xy. For a particular choice of 
(x, y) near (0, 0), either there are no solutions for (u, v) 
or else there are two. 


No. f(x, y) = (—1, 0) has infinitely many solutions, 
namely, (x, y) = (0, y) for any y. 


(a) x2 + y2 #0 
(b) f(z) = —2(x + 2y)/(x? + y?); 
g’(z) = z(y — 2x) /(x? + y?) 


Solve the equation x? — y? = 0 for x. Then C is the 
graph of f(y) = y2/3, 

Fx(0, 0) = 3x2|o,0 = 0. No; it contradicts the 
converse of the implicit function theorem. 


(a 


(b 


Let Fy =X? — y? — u3 40244, F2 = 

2xy + y? — 2u2 + 3v4 + 8. Compute (ðF1/3u) 
(2, —1, 2, 1) = —12, (dF1/dv)(2, —1, 2,1) = 2, 
(aF2/3u)(2, —1, 2, 1) = —8, (ƏF 2/əv)(2, —1, 
2,1) = 12. Then A = —128 # 0, so the implicit 
function theorem says we can solve for u and v as 
functions of x and y near (2, —1, 2, 1). 


(a 


M ultiply and equate coefficients to get ao, a1, and a2 as 
functions of r1, r2, and r3. Then compute the J acobian 


determinant d(a0, a1, a2) /A(r1, r2, r3) = (r3 — r2) 

(rı —r2)(ry — r3). This is not zero if the roots are 
distinct. Thus, the inverse function theorem shows that 
the roots may be found as functions of the coefficients in 
some neighborhood of any point at which the roots are 
distinct. That is, if the roots r1, r2, r3 of x3 + ax? + 
a1X + ao are all different, then there are neighborhoods 
V of (ri, r2,r3) and W of (ag, a1, a2) such that the roots 
in V are smooth functions of the coefficients in W . 


Review Exercises for Chapter 3 


L 


Let g(x, y) = y — kx, so u = f og. Then 
(əu/əx) = (af /əðg)(əg/əx) = (af /əðg)(—k) and 
(du/ay) = (df /Əg)(əg/3y) = (df /g). The result 
follows. 


=( 
= ( 


3. Saddle at (0,0). 


5, p(x, y) = e7} — 2e- Nx — 1) + 2e Ny — 1) + 


9. 


IL 


e-'(x — 1)? — 4e-}(x — 1)(y — 1) +e7}{y — 1)? 


. (a) Saddle point. 


(b) Saddle point for any C. 


(a) 1 
(b) /83/6 


U se the second derivative test; (0, 0) is a local maximum; 
(—1, 0) is a saddle point; (2, 0) is a local minumum. 


13. Saddle points at (n, 0), n = integer. 

15. Maximum ~ 2.618, minimum ~ 0.382. 

17. Maximum 1, minimum cos 1. 

19. z = 1/4 

2L (0,0, +1) 

23. If b > 2, the minimum distance is 2,/b — T; if b < 2, 
the minimum distance is |b]. 

25. Local maximum at (2, 0), local minimum at (0,1) and 
(3, 1), saddles at (0,0), (3,0) and (2, 1). 

27. Not stable. 

29. f(—3, -V/3/2) = 3V3/4 


3L 
33. 


x = (20/3)%⁄3; y = 1073; z = 5⁄3 


The determinant required in the general implicit 
function theorem is not zero, and so we can solve for u 
and v; (du/dx)(2, —1) = 13/32. 


35. A new orthonormal basis may be found with respect to 
which the quadratic form given by the matrix 


rn a b 

E f i 

takes diagonal form. This change of basis defines new 
variables € and ņ, which are linear functions of x and y. 
M anipulations of linear algebra and the chain rule show 
that L v = à(32v/ð£?) + u(3?v/3n?). The numbers à 
and yz are the eigenvalues of A and are positive, because 
the quadratic form is positive-definite. Ata maximum, 
dv/dE = dv/dn = 0. Moreover, 3?v/ð£? < 0 and a2u/ 
Ən? < 0, because if either were greater than 0, the cross 
section of the graph in that direction would have a 
minimum. Then Lv < 0, thus contradicting strict 
subharmonicity. 


y 


» Reverse the inequalities in Exercises 35 and 36. 


39. Let Fy = 2xu3v — yv — 1, Fo = y3v+ x°u2 — 2. 
Compute (3F1/əu)(1, 1, 1, 1) =6, (dF1/dv) 
(1,1,1,1) = 1, (@F2/au)(1, 1, 1, 1) = 2, (aF2/dv) 
(1,1,1,1) = 1. Then A = 4 £0, so the implicit 
function theorem says we can solve for u and v as 
functions of x and y near (1, 1, 1, 1). 


vane [y 3 


4L The equations for a critical point, ds/am = əs /ðb = 0, 
when solved for m and b give m = (y1 — y2)/(X1 — X2) 
and b = ( y2X1 — y1X2)/(Xı — X2). The line y = mx +b 
then goes through (x1, y1) and (x2, y2). 


43. Ata minimum of s, we have 
0 = as/ab = —2 $; (yi — mxi — b). 


4. y=2x+8 


a 


. Leta = (ax — 4a°t). Compute ut = 8a? tanh œ and 

ux = —2au tanh œ so ut + 4a2ux = 0. Then compute 
Uxx = 4a2u — u2/2. Then Uxxx = 4a2uy — UUx, SO We 
obtain ut + Uxxx + UUx = Ut + 4a2uy = 0. 


49. T'+kcT =0, 0” +020 =0, r?R”+rR'—c3R =0 
for some constants c1, C2, and c3. 


Chapter 4 


Section 4.1 


L r(t) = -—(sint)i + 2(cos 2t)j, r’(0) = 2j, 
at) = —(cost)i— 4(sin 2t)j, a0) = —i, 
Kt) =i+2tj 
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3. r(t) =[v2i+ &j-— etk, r(0) = v2i+j- k, 
at) = etj + etk, a0) =j + k, I(t) = 
v2ti+(1+t)j+(1-t)k 

5, (et — e~t, cost — sint, —3t?) 

7. [—3t2(2sint + cost) — t?(2 cost — sint) ]i + 
[3t2(2et + et) + t3(2et — et) ]j 
+[e'(cost — sint) — e™t(— sint + cost)]k 

9. Compute v= (—a sint, a cost, b) so 
a= (—a cost, —a sint, 0). Since the z-component of a 
is identically zero, ais always parallel to the xy plane. 


IL The paths in (a) and (c) are regular, while the path in (b) 
is not. 


(0, —12, —1) and (0, —26, —8) 
m(0, 6, 0) 


—~24n?(cos(2mt/5), sin (27xt/5))/25 


d 2 d dv 
g MO = gV ave 2v-a=0 


6129 seconds 


RB BRB & EE RD 


(a) dt) = (t, €), co < t < oo. The image of this 
path is the graph y = e*. 

(b) qt) = (4 cost, sint), 0 < t < 2x, an ellipse. 

(c) dt) = (at, bt, ct) 

(d) dt) = ($ cos t, 4 sint), 0 <t < 2x, an ellipse. 


27. ct) x c(t) is normal to the plane of the orbit at time t. 
Asin Exercise 26, its derivative is 0, and so the orbital 
plane is constant. 


Section 4.2 
L 2/52 
3. 2(2/2 — 1) 
5. 2 + ; log aay 
7. 2/2 
9. (a) dt) =(1-t, 2 -t, -t), t e [0,1] 
(b) V3 (c) V3 
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IL 27(/5 + V2) 
13. 3+ log2 


15. (a) Because a is strictly increasing, it maps [a, b] 
one-to-one onto [a(a), a(b)]. By definition, vis the 
image of cif and only if there is at in [a, b] with 
dt) = v. There is one points in [a(a), a(b)] with 
s = a(t), so dis) = dt) = v. Therefore, the image 
of cis contained in that of d. Use a~! similarly for 
the opposite inclusion. 


(b) a(b) s=a(b) 
la = 1 |d(s) || ds al JA (cx(t)) læ’ (t) dt 
q S ) 


(a) =a(a 


t=b b 
=a || (c(t) a(t) | t= f Ic(t)ldt =le 
a 


t=a 


(c) Differentiate dusing the chain rule. 


17. (a) le= f? lle(s)iids = f? ds =b—a 

(b) T(s) = ¢(s)/\|¢(s)|| = ¢(s), so T’(s) = e(s). 
Then k = |T] = |¢(s) |). 
Show that if vand ware in R3, 
Iv x wi = |lw— (v- w/||vi2) vil- vi]. Use this to 
show that if o(t) = (x(t), y(t), z(t)) is never 


(c 


(0, 0, 0) and ft) = o(t)/llø(t) I], then 
df 1 T, otot) 
= t t 
dt = mi e È To O 
df |ip(t) x p'(t)I 
a ae FOIE 


With p(t) = c(t), this gives 


c(t)  c(t)-c'(t) 


T) = Ten ~ eto 6 
vey) — Het) x eW 
a Tee 


If s isthe arc length of ¢ ds/dt = ||¢(t)||, and 
therefore 
dT) \jdT ds 
lall-llae a 


| = kre, 


Thus, 


= 1 dT _ jelt) xet) 
edt o ee 


(This result is useful in Exercise 21.) 
(d) 1//2 


19. (a) 
(b) 


2L (a) 


(b 


(c) 
23. (a) 


(b) 


C(t) || = ||\(—sint, cost) = 1 
k = |/T’(t)|| = ||(—cost, —sint)| = 1 
Because cis parametrized by arc length, 


T(s) = c (s), and N(s) = e”(s) /||e’(s) ||. Use 
Exercise 17 to show that 


dB (olf Cc” € Š c" 
H fd +e - c 
ds ( <r) ae Ie ) 


and 


dB (¢xe”)-c’ (cxe) e” 


ds lee = er 


T = 


Obtain T’(t) and ||T’(t)|| as in Exercise 17. Bisa 
unit vector in the direction of € x T’ = (€ x ce’) / 
|C ||, so B= (€ x c”) /||€ x c|. Use the solution 
of Exercise 17 with p = € x c’ to obtain 


dB/dt = (€ x &”)/|I¢ x "| 
{ie x €’) (ex e)I/|le x CIPE x e’), 


and the values of T’ and ||T’|| to get 
N= (IC |//ll¢ x e'e — (€ x e&’)/|e |). 


Finally, use the chain rule and the inner product of 
these to obtain 


dB 
r = — |= (s(t))] -Ns 


1 dB. (¢xc’)-c” 
|ds/dt| dt TEZJA 


V2/2 


Nis defined as T’/||T’||, so T = |T’||N= kN. 
Because T- T’ = 0, T, N, and Bare an orthonormal 
basis for R3. Differentiating B(s) - B(s) = 1 and 
B(s) - T(s) = 0 shows that B’- B = 0 and 

B' -T+ B.T =0. But T -B= ||T’||N-B=0,s0 
B’- T = 0 also. Thus, B’ = (B’- T)T + (B' - N)N + 
(B' . B)B = (B' . N)N = —tN. Also, N . N= 0, 
because N- N= 1. Thus, N = (N - T)T + (N - B)B. 
But differentiating N- T = 0 and N- B = 0 gives 
N.T = -N.T = —k and N -B = N. B' =r, 
and so the middle equation follows. 

@=tT+kB 


25. Follow the hint in the text. 
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Section 4.3 IL The flow lines are concentric circles: 
L y 
y 
Se 
X 
3. y 13. The flow lines fort > 0: 


N y 
™ "a ; 
we ~G , 
5. F= (2y, x): 
y 


>> 15. c(t) = (2e, 1/t, —1/t?) = F(dt)) 


D a 


i 17. c(t) = (cost, — sint, et) = F(dt)) 


a.“ 1 et et 
x 19. (Foot = (p trat e) = et 
2L (a) f(x,y,z) = xyz 
x2 yp zr 
(b) (KYWI= Sty ts 


23. Compare mv? for the escape velocity ve = /2gRo 
7. and the velocity in an orbit of radius Ro given in Section 
4.1. (Ignore the rotation of the earth.) 


25. Use the fact that —VT is perpendicular to the surface 
T =constant. 


27. x'(t) = x(t)eY"), y/(t) = (y(t))2(z(t))2, 
z'(t) = x(t) y(t)z(t) 


Section 4.4 
L ye% — xe” + yey? 
3. 3 


9. (a) corresponds to (i). 5. div V > 0 in the first and third quadrants, 
(b) corresponds to (ii). div V < 0 in the second and fourth quadrants. 
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7. V -F = 0; if F represents a fluid, there is neither 
expansion nor compression; the area of a small rectangle 
remains the same. 

X 

9. 3x? — x? cos (xy) 

IL ycos(xy) + xĉ?sin(x?y) 
B. 0 
15. (10y — 8z)i + (6z — 10x)j + (8x — 6y)k 
17. — sinx 
19. x 
2L (a) V-(V x F) = V. (0, —z, 2xy) = 0 
(b) No, since V x F # Q 
23. (a) ze*? + x cos(xy) + 2x>y3z 
(b) (3x y2z2, xeX2 — 5x4y3z2, y cos(xy)) 
25. (a) Nonsense. (b) Nonsense. 
(c) Nonsense. (d) Vector field. 
(e) Nonsense. (d) Scalar function. 
27. V-F= af ao aM _ o since f, g, and h do not 
əx ay | az 
depend on x, y, and z, respectively. 
29, Vx Vf =0 
3L vxvf=0 
3. VxF#0 
35. Let F = Fii+ F2j + F3kand compute both sides of the 


y 


. (a) 2xyit xj 


identity. 

(c) (—y32x3, 2x2y4z, 2x3z2 — 2xy) 
(b) (3y2xz, 4xz — y3z,0) (d) 4x2yz?2 + x? 

No. 


Separate each expression into its real and imaginary 
parts and then treat the resulting quantity as a vector 
field on R2. Directly calculate its curl and divergence. 
In (a), F = (x? — y?)i — 2xyj; in (b), 


F = (x? — 3xy?)i + (y? — 3x2y)j; and in (c), 
F = (e* cos y)i— (e* sin y)j. Show that V - F = 0 and 
V x F = 0 in each case. 


Review Exercises for Chapter 4 


L 


3. 


5. 


7. 


V1) = (3, —e7}, —x/2);a(1) = (6, e71, 0); 
a 
s(1) = 4/9 + e7? + iN) 
= (2, e71, 0) + (t — 1)(3, —e7}, —/2) 


v0) = (1, 1, 0); a(0) = (1, 0, —1); 
s = J2;\(t) = (1,0, 1) +t(1, 1, 0) 


Tangent vector: v= —(1//2)i+ (1//2)j +k 
Acceleration vector: a= —(1/./2)(i-+ J). 
m(2, 0, —1) 


9. (a) v= (—sint, cost, /3), a= (— cost, —sint, 0) 


(b) Kt) = (1, 0, 0) + t(0, 1, V3) 
(c) 4x 
4Ayz 
VX? +y? +22 
4AxZ 
VZ Ey $22 


4Axy 
= (0, 0, 0) 
Vx? +y? =) 


4Ayz 
VZ Ey 422 


VFS ( 


4Axz 
Vx? +y? +z? 

4Axy 
VX? +y? +z? 


4 
f V 1+ 4723 + £t- dt 
(a) v= (—2t sin(t2), 2t cos(t2), 0);s = 2t 


1 V3 
(0) G -F.o) 


(c) v5x/3 
(d) v= 2/57/3(/3/2, 1/2, 0); s = 2/57 /3 


(e) (1+2) [ver 


TETEN 

19. Compute c(t) and check that it equals F(¢{t)). 

2L 9; 0 

23. 3;—i-j-k 

2. 0;—i-j-k 

27. Vf = ( ye% — y sin xy, xe% — x sin xy, 0); verify that 


V x Vf = 0 in this case. 


29. Vf =(2xe*’ + y? sin xy?, 2xy sinxy2, 0); check that 
V x Vf = 0 from this. 
BL (a) (yz?, xz?, 2xyz); 
(b) (z —y, 0, =x) 
(c) (2xyz3 — 3xy2z2, 2x2y2z — y?z3, y2z3 — 2x?yz?) 
33. div F = 0; curl F = 
(0, 0, 2(x2 + y?) f'(x? + y2) +2 F(x? + y2)) 
35. (a) A cone about i’ making an angle of 2/3 with ï. 
(b) Vg = (3x2, 5z, 5y + 22) 
37. (a) [3P /ax)? + (aP /ðy)?]1/? 
(b) A small packet of air would obey F = ma. 


(c) N ree 
Wind direction 
e/ 
E 
(d) 
S Wind direction 
G 
Ww 


R2 2 
39. (a) VO zo -z 
2(R* + p*)Zo 
b eae, 
n gp? 


41. 680 miles per hour 


Chapter 5 


Section 5.1 


L (a) 1 
(b) 2 
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(c) In128 + In v2 
(d) 5 In2 = In v2 


(b) x +5 


1 
(d) log2 — 5 


13 
3. (a) i 
(c) 1 


5. To show that the volumes of the two cylinders are equal, 
show that their area functions are equal. 


7. 2r3(tan 6) /3 
26 
Xg 


IL (2/7x)(e? +1) 


Section 5.2 
7 
L (a) D (b) e—2 


(c) sind (d) 2In4—2 
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9. Use Fubini’s theorem to write 


d b 
[teow dx dy = f s| f f(x) dx| dy, 
R Č a 


and notice that J f (x) dx is a constant and so may be 
pulled out. 


1L 11/6 


13. By Exercise 2(a), we have: 


E fet E 1 1 
Fimn) = ffx y éxdy= (= 2) (a) 


Then, asm, n — oo, we see that lim f (m, n) = 0. 


15. Because fy dy =f 2y dy = 1, we have 
Ie [fp f(x, y)dy] dx = 1. In any partition of 
R =[0, 1] x [0, 1], each rectangle R jk contains points 
T with x rational and d} with x irrational. If in the 


caii partition of order n, we choose qj, = Sk in 


those oes withO<y<5 1 and Gk = dy when 
y> F, the approximating sums are the same is those for 


(x r 
g(x, y) = 2 


Because g is integrable, the approximating sums must 
converge to J, gdA = 7/8. However, if we had picked 
all qj = ce i ‘al approximating sums would have the 
value 1. 


O<y<}3 
aysh 


17. Fubini's theorem does not apply because the integrand is 
not continuous nor bounded at (0, 0). 


Section 5.3 
L (a) (iii) (b) (iv) 
(c) (ii) (d) (i) 


3. (a) 1/3, both. 
(b) 5/2, both. 
(c) (e? — 1)/4, both. 
(d) 1/35, both. 


5 A= fi, SATa dy dx =2 f1, VTT ZX dx = 
r?[arcsin 1 — arcsin(—1)] = zr? 


7. 28,000 ft3 


9. 0 


IL y-simple; 7/2. 
7 

3 

15. 507 

17. 1/24 


19. Compute the integral with respect to y first. Split that 
into integrals over [—#(x), 0] and [0, ¢(x)] and change 
variables in the first integral, or use symmetry. 


2L Let {Rij} bea partition of a rectangle R containing D 
and let f be 1 on D. Thus, f*is1on D and0 on R\D. 
Let cjk € R\D if Rij is not wholly contained in D. The 
approximating Riemann sum is the sum of the areas of 
those rectangles of the partition that are contained in D. 


Section 5.4 
L (a) fo f dydx 
(b) Mo Sp dx dy 
(c) fig {oe dy ax 
(d) fo Jeon? ax dy 
3. (a) 1/8 (b) 2/4 (c) 17/12 
(d) G(b) — G (a), where dG /dy = F (y, y) — F (a, y) 
and oF /əx = f(x, y). 
1 
5 3(¢- 1) 
7. Note that the maximum value of f on D ise and the 


minimum value of f on D is 1/e. Use the ideas in the 
proof of Theorem 4 to show that 


E az ff fe y) dA <e. 


9. The smallest value of f(x, y) =1/(x? + y? + 1) on D 
is g at (1, 2), and so 


I f(x, y) dx dy > 2. areaD =, 
D 6 


The largest value is 1, at (0, 0), and so 


I f(x, y) dx dy <1-areaD = 6. 
D 


IL Sabe 


13. 1(20,/10 — 52) /3 
15. /3/4 


17. D looks like a slice of pie. 


1 x V2 V2-x2 
T pi F(x, y)dy] ax f p toya dx 
0 0 il 0 


19, Use the chain rule and the fundamental theorem of 
calculus. 


Section 5.5 

L (a) (ii) (b) (i) ~~ (ce) (iii) (d) (iv) 
3. 1/3 

5. 10 


7. x? +y? <72< /x24 y?2, 
-Vl-y2<x<v/l-y?,-l<y<l 


9. 0<z< /1-x?~-y?, 
-VJl-y<x<vV/l-y4,-l<y<l 


IL 5072/6 
B. 1/2 

15. 0 

I7. a°/20 
19. 0 

2L 3/10 
23. 1/6 


a ff ae (x, y, Z) dz dy dx 
Va 


4 Jay? 
27. [ Vee i, f(x, y, z) dz dy dx 
f(x,y) 
2. || | az ax dy= f] tuyida 
pd Jo D 


3L Let M. and me be the maximum and minimum of f on 
Be. Then we have the inequality me vol (Be) < Sffa, 
f dV <M, vol (Be). Divide by vol (Be), let e + 0 and 
use continuity of f. 
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Review Exercises for Chapter 5 


L 81/2 
TOE 
3. 7E -e+7 
5. 81/2 
is ..8 
7. 7E -e+7 
9. 7/60 
1L 1/2 


I3. In the notation of Figure 5.3.1, 


b 
If axdy = f [b2(x) — b1(x)] dx. 
D a 


15. (a) 0 (b) 1/24 


17. y-simple; 27 + 22. 


19. x-simple; 73/3. 


2L y-simple; 33/140. 


y 
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23. y-simple; 71/420. 


The integral in the order dy dx dz, for example, is 


1 pl pl 
| f f f(x, y, z) dy dx dz. 
0 Jz J1-x 


Chapter 6 
Section 6.1 
x L (a) One-to-one, Onto. 
(b) Neither. 
(c) One-to-one, Onto. 
25. 1/3 (d) Neither. 
27. 19/3 3. An appropriate linear function T is given by 
T (x, y) = (x, —% + 44), or in matrix form, as: 
29. 7/12 
-33 
i 5. S = the unit disc minus its center. 
N 7. D = (0, 3] x [0, 1]; yes. 
‘ 9. The image is the triangle with vertices (0, 0), (0, 1), and 


1 
3L The function f(x, y) = x? + y? + 1 lies between 1 and 
2? + 1=50nD, and so the integral lies between these 1L 
values times 4x, the area of D. 


33. Interchange the order of integration (the reader should 
draw a sketch in the (u, t) plane): 


X t X X 
f | tora f f Fda 
0 Jo 0 Ju 
X 
= f (x= uF (u) du. 
0 


37. The region is the shaded region W in the figure. 15. 
Z 


35. 7/12 


(1, 1). T is not one-to-one, but becomes so if we 
eliminate the portion x* = 0. 


D is the set of (x, y, z) with x? + y2 + z? < 1 (the 
unit ball). T is not one-to-one, but is one-to-one on 
(0, 1] x (0, z) x (0, 27]. 


Showing that T is onto is equivalent in the 2 x 2 case to 
showing that the system ax + by = e, cx + dy = f can 
always be solved for x and y, where 


kal 

A= ; 

c d 

When you do this by elimination or by Cramer's rule, the 


quantity by which you must divide is det A. Thus, if det 
A + 0, the equations can always be solved. 


Suppose that T (x) = T (y). Then 


Ax+v= Ay+v 
Ax= Ay. 


By Exercise 12, this implies that x = y if and only if 
detA + 0. 

Showing that T (x) = Ax-+ vis equivalent to showing 
that 


T(x) =Ax+v=y 
or 


Ax=y-VvV 


has a solution for any choice of y € R2. This happens if 
and only if detA + 0, by Exercise 13. Finally, verifying 
that T takes parallelograms to parallelograms follows 
exactly as in Exericse 14, by simply applying T to both 
sides of the given equation and simplifying. 


17. We can show that T is not globally one-to-one by 
example. A simple choice is to compare the point (1, 0) 
with (—1, 0), which correspond to the polar coordinates 
r=1,6=Oandr = 1, 6 = x, respectively. We note: 


T(1cos0, 1sin0) = (12 cos 0, 12 sin 0) 
= (1? cos 2x, 12 sin 2x) —T(1cosz, 1sinz). 


Since T (1, 0) = T (—1, 0), T is not one-to-one. 


Section 6.2 


L A good substitution might beu = 3x + 2y, v =x — y, 


: .__ O(X, y) o1 
which has J acobian Wa S 


3. zey 


5. D istheregion 0 < x <4, Sx+3< y< 5x+6. 
(a) 140 (b) —42 


N 


* istheregion0 <u <1,0<v<2; 


D 
9-2/2- 3V3). 


x 


An[/3/2 — log (1 + V3) + log v2] 
4x log(a/b) 
0 


2n[(b2 + 1)e7?° — (a2 + 1)e-27] 
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3L 24 
33. (a) rabe (b) * zabe 
3 5 
35. (a) Check that if T (uz, v1) = T (u2, v2), then u1 = u2 
and v1 = v2. 
(b) 160/3 


37. S [f ((au2)¥/3, (av?) t3)u 7130713] du dv 
D* 


Section 6.3 


1 
L (1 5a) 


3. [x2 —sin(x2)]/x3 


11 65 
5 (e s) 
7. $503.64 


9. (a) 5, where 6d is the (constant) mass density. 
(b) 37/12 


IL 5007 (10 — 5) 


1 11 
aa (533) 
15. 1/4 


17. Letting 5 be density, the moment of inertia is 


k pdx paseco 
f I f (otsin? ¢) dodo dẹ. 
0 Jo 0 


19. (1.00 x 108)m 


ZL (a) Theonly plane of symmetry for the body of an 
automobile is the one dividing the left and right 
sides of the car. 


Z- [fy 6(% y, 2) dx dy dz is the z coordinate 

of the center of mass times the mass of W . 
Rearrangement of the formula for z gives the first 
line of the equation. The next step is justified by the 
additivity property of integrals. By symmetry, we 
can replace z by —z and integrate in the region 
above the xy plane. Finally, we can factor the minus 
sign outside the second integral, and because 

5(X, y, Z) = ô(u, v, —w ), we are subtracting the 
second integral from itself. Thus, the answer is 0. 


(b 
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(c) In part (b), we showed that Z times the mass of W is 
0. Because the mass must be positive, Z must be 0. 


(d) By part (c), the center of mass must lie in both 
planes. 


23 V = —(4.71 x 10!9)G m/R ~ —(3.04 x 109)m/R, 
where m is the mass of a test particle at distance R from 
the planet’s center. 


25. Inthe x, y-plane, the circle D given by 
(x — a)? + y? =r? has center (and center of mass) 


(a, 0). Also, the area of the circle has area A(D) = ar. 
Therefore, by Exercise 24 we have: 


Vol(W) = 2x(a)(ar’). 
Section 6.4 
14 
3. 3/16 
5 ata 


7. (a) 3z 
(b) A<1 

9. Integration of {{e-*Y dx dy with respect to x first and 
then y gives log 2. Reversing the order gives the integral 
on the left side of the equality stated in the exercise. 


IL Integrate over [e, 1] x [e, 1] and let e + 0 to show the 
improper integral exists and equals 2 log 2. 


B T1433)? ay 


15. Use the fact that 
sin? (x — y) 1 


<= é 
Jl-x2-y2 ~ /1—x2-y2 


17. Use the fact that eX’+Y" /(x — y) > 1/(x — y) on the 
given region. 


19. Each integral equals 1/4, and Theorem 3 (Fubini's 
theorem) does apply. 


2L Here, welet Dı =[0, 1] x [0, 1], and D2 = [1, co] x 
[1, co], as in the hint. On D1, let g(x, y) = aye and 
f(x,y) = may Since x, y = 0, itis clear that 
0 < f(x, y) < g(x, y) for all points in D1. Therefore, 
since Sho, g(x, y) dx dy exists by Exercise 5, we know 
that Fp: f(x, y) dxdy must also exist. 


You may use a similar argument for the region D2 by 
choosing a different g(x, y) and applying the result of 
Exercise 6. Once ff f(x, y) dx dy exists over both the 
regions Dı and D2, it will exist also over their union 
D= Diü Dyz 


Review Exercises for Chapter 6 


La T= Da aG) 
(b) ffa F(x, y) dx dy = 4 ff, f(2u + v, 2v) du dv 


3. 3 (Usethe change of variables u = x? — y?, v = xy.) 
1 

5 z742- t 

7. (5x/2)V15 

9. abc/6 


IL Cut with the planes x + y +z = </k/n, 
1<k <n-—1,k an integer. 


13. (25 + 10/5)2/3 


15. (e — e~!)/4 (Use the change of variables 
u=y—-X,v=y+x.) 


17. (9.92 x 10°)m grams 


19, (a) 32 


(b) This occurs at the point of the unit sphere 
x? + y? + z? = 1 inscribed in the cube. 


(0, 0, 3a4/8) 
4x In(a/b) 
m/2 


(a) 9/2 (b) 64x 


B NARE 


Work the integral with respect to y first on the region 
Det ={(x, y)le <x <L,0 < y < x} to obtain 

let =ffp,, f dx dy = fy x-3(1—e-*) dx. The 
integrand is positive, and so |,_ increases as e —> 0 and 
L — oo. Bound 1—e™ above by x for0 < x < land 
by 1 for1 <x < oo to see that I... remains bounded 
and so must converge. The improper integral does exist. 


3L (a) 1/6 (b) 167/3 


33. 2x 


Chapter 7 
Section 7.1 
(3coszt, 3sinzt), t € [0, 1] 
=a -f (6—9,0), t e[1 2] 


(t,sinzt), te [0,1] 
3. y(t) = { (2x — xt, 0), te [1,2] 


5. y(t) =(3cos2zt, 4sin2zt, 3), te€[0,1] 
7. y(t) = (t,t, t3), t © [—3, 2], or 
y(t) = (5t — 3, 5t — 3, (5t —3)3), te [0,1] 


9, [fovsads= f tity zaa 
c | 


1 
=) 0-1dt=0 
0 


IL (a) 2 (b) 52/14 


1B —5(1 + 1/22)? + 3129) 


15. (a) The path follows the straight line from (0, 0) to 
(1, 1) and back to (0, 0) in the xy plane. Over the 
path, the graph of f is a straight line from (0, 0, 0) 
to (1, 1, 1). The integral is the area of the resulting 
triangle covered twice and equals /2. 


J/2(1 — t*) when —1<t <0 
(b) st) = { 
J2(1 +t!) when O<t<1. 
The path is 
doe (1—s//2)(1,1) when O<s<v2 
~ L(s/(/2—1))(1,1) when V2 <s <2V2 


and fo fds = V2. 
17. 2a/zx 


19. (a) [2/5 + log(2 + /5)]/4 
(b) (5/5 — 1) /[6./5 + 3log(2 + /5)] 


2L Since the graph g is parameterized by y(t) = (t, g(t)), 
we have y(t) = (1, g’(t)), and thus: 


y(t) | = V1 + (g'(t))2. 
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a (a) j 
g 
(b) Solving for y, we have: 
y =—-V 2x —x2 +1. 


(N ote that the negative square root was chosen for 
y.) Therefore our formula becomes: 


1 
1 
dx. 
f —29(/2x — x2 + 1) 
Section 7.2 
L -1 


3. (a) 3/2 (b) 0 (c) 0 (d) 147 


5. 9 


7. By the Cauchy- Schwarz inequality, 
|F(qt)) -e(t)| < ||F(q{t)) || c(t) |] for every t. Thus, 


[F-44- 


b 
<f IF(dt)) -c (t)i dt 
a 


b 
7 F(dt))- c(t) dt 
a 


b 
<f I|F(q{t)) || e(t) dt 
a 


b 
<M | ye(oua—=mi. 
a 


a -m-n 

IL 0 

I3. The length of c 

15. If c(t) is never 0, then the unit vector T(t) = 


c(t) /||¢(t) || is a continuous function of t and so is a 
smoothly turning tangent to the curve. The answer is no. 


17. 7 
19. Use the fact that F is a gradient to show that the work 
done is — — —, independent of the path. 
Ro Ri 
2L (a) |!¢(x)|I 


(b) f has a positive derivative; it is one-to-one and onto 
[0, L] by the mean-value and intermediate-value 
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theorems. It has a differentiable inverse by the 
inverse function theorem. 


(c) g/(s) =1/||¢(x)|], wheres = f(x). 


(d) By the chain rule, (s) = €(x) - g/(s), which has 
unit length by part (c). 


Section 7.3 


Lz=2(y-1)41 


3. 18(z — 1) — 4(y +2) — (x — 13) =0 
or 18z — 4y -x -13 = 0. 


5. Not regular when u = 0. 


7. (a) (iii) (b) (i) (c) (ii) (d) (iv) 
9. The vector n= (cos v sinu, sin vsinu, cosu) = (x, y, 2). 
The surface is the unit sphere centered at the origin. 


IL n= —(sinv)i— (cos v)k; the surface is a cylinder. 


I3. (a) x = xo + (y — yo)(3h/3y)( Yo, Zo) + (z — 20) 
(ah /3z)( yo, zo) describes the plane tangent to 
x = h(y, z) at (Xo, Yo, Zo), Xo = h( yo, Zo). 
(b) y = yo + (x — Xo0)(dkK/AX)(Xo, Zo) + (z — zo) 
(ək/əz)(x0, Zo) 


15. z— 6x — 8y +3=0 


17. (a) The surface is a helicoid. It looks like a spiral ramp 
winding around the z axis. (See Figure 7.4.2.) It 
winds twice around, since 6 goes up to 4r. 

(b) n= +(1//1+ r2)(sin@, —cosé, r) 

(c) yox — xoy + (x + yê)z = (x4 + yê) zo. 

(d) If (Xo, Yo, Zo) = (ro, COS 80, ro SiN, 80), then 
representing the line segment in the form 
{(r COS 4, r sin 8o, @9)|0 <r < 1} shows that the 
line lies in the surface. Representing the line as 
{(xo, tyo, Z0)|0 < t < 1/(xé + y§)} and substituting 
into the results of part (c) shows that it lies in the 
tangent plane at (Xo, Yo, Zo). 


19. (a) Using cylindrical coordinates leads to the 
parametrization 


(2,6) = (\/25 +22 cos, \/25 + 22sin8, z), 
-œ <Z2<00,0<0<2n 
as one possible solution. 


(b) n= (v25 +2? cos6, v25 +22 sind, —2) //25 + 222 


(c) xox + yoy = 25 


(d) Substitute the coordinates along these lines into the 
defining equation of the surface and the result of 
part (c). 


2L (a) u > u,v v, u UY, and v > v? all map R 
onto R. 


(b) Ty x T, = (0, 0, 1) for &1, and this is never O For 
the surface ©), Ty x T, = 9u2v2(0, 0, 1), and this 
is Oalong the u and v axes. 


We want to show that any two parametrizations of a 
surface that are smooth near a point will give the 
same tangent plane there. Thus, suppose 

©: D c R? > R? and Y:B c R? > R? are 
parametrized surfaces such that 


(c 


®(uo, vo) = (Xo, Yo, Zo) = W(So, to) (i) 


and 
(T? x Ty lee 7 o 


and (TY xTY)| 4) ZO ii) 


so that ® and W are smooth and one-to-one in 
neighborhoods of (uo, vo) and (So, to), which we 
may as well assume are D and B. Suppose further 
that they “describe the same surface,” that is, 
(D) = (B). To see that they give the same 
tangent plane at (Xo, yo, Zo), show that they have 
parallel normal vectors. To do this, show that there 
isan open setC with (uo, vo) €C c D anda 
differentiable map f:C — B such that ®(u, v) = 
W(f(u, v)) for (u, v) e C . Once you have done 
this, computation shows that the normal vectors are 
related by T? x T? = [a(s, t)/a(u, v) ITY x TY. 
To see that there is such an f, notice that since 
TY x T 4 O at least one of the 2 x 2 
determinants in the cross product is not zero. 
Assume, for example, that 


əx ay 
as as 
dx ay 
ət ət 
Now use the inverse function theorem to write (s, t) 


as a differentiable function of (x, y) in some 
neighborhood of (Xo, yo). 


(d) No. 


40. 


23. (a) Weplug the parametrization into the left hand side 
of the equation, and simplify: 


(\/x2 + y2— R)? 4-2? 


= (4/((R +r cosu) cosv)2 + ((R +r cosu) sin v)? 


—R)2 + (r sinu)? 


=(./(R +r cosu)? — R)? +r? sin? u 
= (R +r cosu — R)? +r?sin? u 
= (r cosu)? +r? sin? u 


=r?. 


(b) We calculate the associated normal element 


Ty x T, = (—r cosu cosv(R +r cosu), 
—r cosu sinv(R +r cosu), 
—r sinu(R +r cosu)) 


and find that it is not equal to the zero vector for any 
choice of (u, v). 


Section 7.4 
L 4r 
3. 3 wlV2 log(1+ v2) 


5. (a) (e sin v, —e" cos v, e) 


(b) x+z= > 
(c) 2/2(e — 1) 


o. Sa(6vő— 8) 


IL The integral for the volume converges, whereas that for 
the area diverges. 


2m prn 
ate) = f 1 
0 0 


v/a2b2 sin? ¢ cos? + b2c? sint o cos? 0 + a2c2 sin’ ¢ sin? 0 do do 


15. (7/6)(5v5 — 1) 
I7. (2/2) V6 


19. 4/5; for fixed 9, (x, y, z) moves along the horizontal 
line segment y = 2x, z = 6 from the z axis out to a 
radius of /5| cos 6| into quadrant 1 if cos@ > 0 and into 
quadrant 3 if cos@ < 0. 


2L (x +2)/(x —2) 
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23. x(a +b)vV1+m?(b-— a) 


25. Š (9V3 -8/24 1) 


27. With f(x, y) = y R2 — x2 — y2, (4) becomes 


| x+y? 
Muc 
asi = f) ese Ry a 


= || =o 


where D is the disc of radius R. Evaluate using polar 
coordinates, noting it is improper at the boundary, to get 
Qn R2. 


Section 7.5 


Ls 


3. 11/14 


5. (a) For this surface parameterized by ®, we have: 


x? — y? = (u + v)? — (u — v)? 


= (u2 + 2uv + v?) — (u? — 2uv + v?) 
= 4uv 
=:47; 
(b) 0 
3V2 +5 
A 
9. zra? 
IL (a) VūrR? (b) 20 RZ 
m (5/5 1 
B(G +55) 
15. 167R3/3 


17. (a) The sphere looks the same from all three axes, so 
these three integrals should be the same quantity 
with different labels on the axes. 


(b) 4x R4/3 
(c) 42R4/3 


19. 8 
2L (R/2, R/2, R/2) 
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23. (a) Directly compute the vector cross product Ty x T, Section 7.7 
and then calculate its length and compare your 
answer to the left-hand side. L Apply tormula (3) of ENS SeenON and simplify; H = 0 
(b) In this case, F = 0, so A(s) = ffy VEG du dv. a a 
(c) 47a? 3. Apply formula (3) of this section and simplify. 
25. Leta = ðx/ðu, b = ay/du, c = əx /ðv, and —4a5p6 
d = dy/dv. The conditions (a) and (b) in Exercise 16 5. K = (atb? + 4b4u2 + 4a4v2)2 
are then a2 + b? = c? + d? and ac + bd = 0. Show that 
a Æ 0 and, by a normalization argument, show that you 7. Using the standard parametrization of the ellipsoid 
can assume a = 1. Now calculate further. (u, v) = (a cosu sin v, a sinu sin v, c cos v), u € 
[0, 27x], v € [0, x], from Exercise 6 you should have 
27. 2a? found that the Gauss curvature of the ellipsoid is: 
atc? 
Section 7.6 = 
(a4 cos? v + a2c2 cos2 u sin? v + a2c? sin? u sin? v)2 
L z atc? 


(a4 cos? v + a2c? sin? v)? ` 
3. (a) 187 (b) 36x 

Then, the area area element for the ellipsoid is given as: 
5. +48x (the sign depends on orientation). 


Ty x Ty = sinvy/a4 cos? v + a2c? sin? v. 
7. 4x 


This yields the integral: 
9. 2x (or —2:r, if you choose a different orientation). 


on afc? sin v 
IL 27 du dv. 
(a4 cos? v + a2c2 sin? v)? 


13. 127/5 


To evaluate this integral, we try to get it into one of the 


15. With the usual spherical coordinate parametrization standard forms found in the tables contained in the text: 


Te x Tg = —singr (see Example 1). Thus, I _ afc? sin v 
(a4 cos? 


202 3 
[[Fas= ff FT xt doae oe 
S 


4,2 
on f atc sinv i 
=) (F-r)singdodo o a3(a2 cos? v + c? sin? v)? 
2m pr =2nac? | = 3 dv 
-/ | F, singdgdé 0 (a2 cos? v+ c?(1-— cos? v)? 
o Jo 


IT 
sin 
=2nac? | p z dv 
and o ((a2— c?) cos? v + c2)? 


bh n O 2mac? [ sin v dy 
fdS = f sing dọ d8. (a2 _ ¢2)3 ; 
n! j I os le (co vt 82)’ 


Atthis point, make the substitution w = cos v: 


z dudv 


17. For a cylinder of radius R = 1 and normal component Fr, 
sin v 


2 T 
b pdx a zf zdv 
[fres= ff F, dé dz. (a* —c*)2 Jo (cot v+ 38)’ 


19, 27/3 _ 2nac? ‘A 1 


2 (a? =c)? Ja aT? 
21 za bcr (w +( za) ) 


Finally, use the trigonometric substitution 

w= a tan 6 to finish the integration. The final 
solution will simplify to 4x, verifying the 

Gauss-B onnet theorem. 


9. Apply formula (3) of this section and simplify. 


1L Apply formula (2) of this section and simplify; 
K = —h’’/[(1 + (h’)?)h]. 
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L (a) 3V2(1 — eĉ") /13 
(b) —2/2/2 
(c) (236, 158/26 — 8) /35-(25)3 
(d) 8/2/189 


3. (a) yi (b =1/2 
T 
5. 2a3 
7. (a) A sphere of radius 5 centered at (2, 3, 0); 


(6, p) =(2+5cosdsing, 3+ 5sinésing, 5 
cos¢);0< 0 <27;0< <n. 
(b) An ellipsoid with center at (2, 0, 0); 


(6, $) = (2 + (1/V2)3cosé sing, 
3sindsing, 3cos¢);0 <6 <27,0<¢ <x. 


(c) An elliptic hyperboloid of one sheet; 


®(6,z) = (5 ve+ 22? cos 0, SVF sind, 2); 


0 <98 <2r, -%0 <Z<o. 


1 i 
9, A(®) == K 3 cos? 6 + 5d8; ® describes the upper 
nappe of a cone with elliptical horizontal cross sections. 


IL 113/6 
B. 2/3 
15. 5/5/6 


T7. (a) (eY cosxz, xe” coszz, =z xe sinzz) 
(b) 0 


19. 5+1) 

2L n= (1//5)(—1, 0, 2),2z-—x = 1 

23. 0 

25. If F = V¢, then V x F = O (at least if is of class C 2; 


see Theorem 1, Section 4.4). Theorem 3 of Section 7.2 
shows that f- V# - d= 0 because cis a closed curve. 
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27. (a) 247 (b) 24x (c) 60r 


29. (a) [/ R2 + p?(zo — z1)]/p 


2zo(R2 + p2)/p2g 


Chapter 8 


Section 8.1 
(2t—1,-t+1), te[0,1] 
L y(t) = (2t—1,2t—2), te[l1,2] 
(—4t + 11, -t + 4), t € [2, 3] 


3. 8 
5. 8 
7. 61 
9. —8 


IL (a) 0 
(b) —2R2 
(c) 0 
(d) —2R2 


13. 37a? 

15. 37/2 

I7. 32(b2 — a?)/2 

19. (a) Both sides are2m. (b) 0 
2L 0 

23. zab 


25. A horizontal line segment divides the region into three 


regions of which Green’s theorem applies; now use 
Exercise 16 or the technique in Figure 8.1.5. 


27. 97/8 
29. If > 0, thereisaé > 0 such that |u(q) — u(p)| < £ 


whenever ||p— ql| =  < ô. Parametrize 3B (p) by 
q9) = p+ e(cosé, sing). Then 


2x 
I (o) = 2zutpli < f \u(q{9)) — u(pIdo < 27x. 
0 
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33. 


35. 
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If p= ( p1, p2), parametrize 3B ,(p) by 

pt> (pı +e coso, p2 + p sin 8), then 

I(p) = i u( pı + pcosé, p2 + psind) dé. 
Differentiation under the integral sign gives 


dl 27 2a 
— f Vu-(cosé, sino) 60 = f Vu-nd@ 
0 0 


dp 
Hassi) V% dA 
B, on P JJe, 


=: f 
P Jas, 


(the last equality uses Exercise 30). 


Using Exercise 32, 


R 27 
1. uda = | f u[p+ p(cosé, sind) ]o d8 do 
Br 0 Jo 
R 
al (J uds Jap 
0 3B, 


R 
= 2x pu(p) dp = R2u(p). 
0 


Suppose u is subharmonic. We establish the assertions 
corresponding to Exercise 34(a) and (b). The argument 
for superharmonic functions is similar, with inequalities 
reversed. 

Suppose V2u > 0 and u(p) > u(q for all qin 
Br(p). By Exercise 31, |’(o) > Ofor0 < p < R, and 
so Exercise 32 shows that 27ru(p) < | (o) < | (R) for 
0<p<R.Ifu(q) <u(p) forsomeq= p+ 
p(COS 69, Sin 60) € Br (p), then, by continuity, there is 
an arc [8o — ô, 69 + 6] on ƏB (pP) whereu < u(p) —d 
forsomed > 0. This would mean that 


A 


27 
2xu(p) < I (p) = F u[p+ p(cos9, sin)]o do 
0 


IA 


This contradiction shows that we must have u(q) = u(p) 
for every qin Bg (p). 

If the maximum at pis absolute for D, the last 
paragraph shows that u(x) = u(p) for all xin some disc 
around p If c [0, 1) > D is a path from pto q then 
u(q{t)) = u(p) for all t in some interval [0, b). Let bo be 
the largest b € [0, 1] such that u(qt)) = u(p) for all 
t € [0, b). (Strictly speaking, this requires the notion of 
the least upper bound from a good calculus text.) 
Because u is continuous, u(qbo)) = u(p). If bo 1, 
then the last paragraph would apply at dbo) and u is 
constantly equal to u(p) on a disc around dbo). In 
particular, there is aé > 0 such that u(d{t)) = u(dbo)) = 
u(p) on [0, bo + 4). This contradicts the maximality of 
bo, so we must have bọ = 1. That is, dq) = dp). 
Because qwas an arbitrary pointin D, u is constant 
onD. 


(2x — 28)u(p) + 26[u(p) — d] < 27u(p) — 2 ôd. 


37. 


Assume V2u1 = 0 and V2u2 = 0 are two Solutions. Let 
$ = u1 — u2. Then V2¢ = 0 and $(x) = 0 for all 

x € aD. Consider the integral f f} V2 dA = 

— [fp V- Veda. Thus, ffa Vo. Vo dA = 0, which 
implies that V = O and so ¢ is a constant function and 
hence must be identically zero. 


Section 8.2 


L 


hb RB ep N wow 


5 


23. 
25. 


(3t — 1,1, —6t +4), te [0,1] 
(2,2t —1,-6t+ 4), te[1,2] 
(—3t + 8, 3, 10t — 28), t € [2, 3] 
(—1, —2t + 9, 2t — 4), t € [3,4] 


3v), u e [-1, 2], v 


y(t) = 


(u, v) = (u, v, 5 — 2u [1, 3] 
0 (Note: F is a gradient field.) 

us 
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—2n 

Each integral in Stokes’ theorem is zero. 

0 

—4n//3 

0 

+27 

Using Faraday’s law, f f [V x E + 0H/at]-dS = 0 for 
any surface S. If the integrand were a nonzero vector at 
some point, then by continuity the integral over some 
small disc centered at that point and lying perpendicular 
to that vector would be nonzero. 

The orientations of 0S; = dS must agree. 

Suppose C is a closed loop on the surface drawn so that 
it divides the surface into two pieces, Sı and S2. For the 
surface of a doughnut (torus) you must use two closed 
loops; can you see why? Then C bounds both Sı and S2, 


but with positive orientation with respect to one and 
negative with respect to the other. Therefore, 


J [vxtas= |f vxras+ ff vxr-as 
S Sı S2 
= | F-ds- | F-ds=0. 
Cc C 


27. (a) IfC =95, fe v-ds= ff (V x v)-dS= 


; O-ds=0. 


(b) fe veds= f? v- clt) dt =v- f? elt) dt = 
v- (db) — da)), where c [a,b] > Risa 
parametrization of C . (The vector integral is the 
vector whose components are the integrals of the 
component functions.) If C is closed, the last 
expression is 0. 


29. Both integrals give 7/4. 
3L (a) 0 (bbx (c)zx 
33. —20z (or 20x if the opposite orientation is used). 


35. One possible answer: The M dbius curve C is also the 
boundary of an oriented surface S; the equation in 
Faraday’s law is valid for this new surface. 


Section 8.3 
L (a) f =x?/2+y2/24+C 
(b) Fis nota gradient field. 
(c) f= pote +c 


3. (a) There exists such a G, but no such g. 
(b) There exists such a g, but no such G. 
(c) There exists such a g, but no such G. 
(d) Neither function exists. 
5. If F= Vf = Vg and C is a curve from vto w, then 


(f —g)(w) — (f — g)(v) = fe Vif — 9) -ds= 0 and 
so f — g is constant. 


7. x?yz — cosx +C 


9. Yes, itis the gradient of g(x, y) = F (x) + F (y), where 
F= f(x). 

IL No; V x F=(0,0,-x) #0 

, 13 1 

B3. esin1 + 3° -5 

15. 3.5 x 1029 ergs 

I7. (a) f(x,y,z) =x2yz 
(b) Nota gradient field. 
(c) Nota gradient field. 
(d) f(x, y,z) =x cosy 


19. Use Theorem 7 in each case. 
(a) —3/2 (b) —1 
(c) cos(e2) — cos(1/e) /e 
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2L (a) No. 


(b) (5207-2277) or 
12 lz 3 
(5 2xyz sy" X*Z z0). 


2. Ka xj + yk) 


25. (—zsiny + ysin x, xz cos y, 0) (Other answers are 
possible.) 


27. (a) V xF= (0,0,2) 40 

(b) Let dt) be the path of an object in the fluid. Then 
F(c{t)) = c(t). Lete{t) = (x(t), y(t), z(t)). Then 
x’ = =y, y’=xX,andz’ = 0, and soz is constant 
and the motion is parallel to the xy plane. Also, 
x”+x = 0, y” +y =0.Thus, x = Acost +B sint 
and y = C cost + D sint. Substituting these values 
in x’ = —y, y’ =x, wegetC = —B, D =A, so 
that x? + y? = A? + B? and we have a circle. 


(c) Counterclockwise. 


GmM 
29. (a) = f 
F CENETA Ye 2 


x? 4 y? +22 — 3x2 


V.F = -—GmM 
i (x? + y? + z?)5/2 


x? yz? —3y? Peyer- 
(x2 +y? +z2)5/2 (xe + y? + z?)5/2 
=0 
Let S be the unit sphere, S; the upper hemisphere, 


S2 the lower hemisphere, and C the unit circle. If 
F = V x G, then 


[Fes | Fass || F-dS 
S Sı S2 
= f &-ds- | G-ds=0. 
c f 


But [f F-dS=-GmM ff.(r/|iri?)-ndS = 
—4nG mM , because ||r|| = 1 and r = non S. Thus, 
F = V x G is impossible. This does not contradict 
Theorem 8 because F is not smooth at the origin. 


(b 


Section 8.4 
L 3 

3. 4rr? 

5, 47 

7. 3 
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9. (a) 0 
(b) 4/15 
(c) —4/15 


IL 6 


17. Apply the divergence theorem to f F using 
V-(fF)=Vf-F+fv-F. 


19. if F = r/r?, then V - F =1/r2. If (0, 0, 0) g 2, the 
result follows from Gauss’ theorem. If (0, 0, 0) € Q, we 
compute the integral by deleting a small ball 

= {(X, y,z)|(x2 + y? +.22)!/2 < e} around the 
origin and then letting € — 0: 


[Tf = [flog PY = rae 
mung (ff ee 


The integral over 4B, is obtained from Theorem 10 
(Gauss’ law), becauser = s everywhere on Bg. 


2L Use the vector identity for div( f F) and the divergence 
theorem for part (a). Use the vector identity 
V-(fVg —gVf) = fV2g —gV2f for part (b). 


23. (a) If (P) = fffy o(@/(47|Ip— qll) dV (q, then 


Volp = // Lola) /471Vp(1/\Ip— ql) dV (aq) 
w 


-Jf | Ilg /4rlllp- g/1p- a? 16V (a. 


where Vp means the gradient with respect to the 
coordinates of pand the integral is the vector whose 
components are the three component integrals. If p 
varies in V U dV and nis the outward unit normal 
to aV, we can take the inner product using these 
components and collect the pieces as 


eq 1 
y .n=— dV (q. 
neeg i ar pqp P IVA 


Ena 


Thus, 


Jf voip -ndvip) =- |f 
ay av 
p(q@ 1 
II 4x Ip- p-ga” q- nda) dV (p). 


There are essentially five variables of integration 
here, three placing qin W and two placing pon 3V. 
Use Fubini’s theorem to obtain 


If. Vo-n-dS 
E 


If V is a symmetric elementary region, Theorem 10 
says that the inner integral is 4x if qe V and 0 if 
q¢V.Thus, 


[frm [I], rawa 


Because p = 0 outside W, 


a ve-ns=— ||| saavia. 


If V isnot asymmetric elementary region, 
subdivide it into a sum of such regions. The 
equation holds on each piece, and, upon adding 
them together, the boundary integrals along 
appropriately oriented interior boundaries cancel, 
leaving the desired result. 


By Theorem 9, ffy Ve-dS= fff, V2@dV, and 


so fff V2edV = — fff, edV. Because both p 


and V2 are continuous and this holds for 
arbitrarily small regions, we must have V2¢ = —p. 


(b 


If the charge Q is spread evenly over the sphere S of 
radius R centered at the origin, the density of charge per 
unit area must be Q /4nR?2. If pis a point not on S and 
q € S, then the contribution to the electric field at p due 
to charge near qis directed along the vector p— q 
Because the charge is evenly distributed, the tangential 
component of this contribution will be canceled by that 
from a symmetric point on the other side of the sphere at 
the same distance from p. (Draw the picture.) The total 
resulting field must be radial. Because S looks the same 
from any point at a distance || pj] from the origin, the 
field must depend only on radius and be of the form 
E= fi(r)r. 
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If we look at the sphere £ of radius ||], we have 7. (a) 
Form? (a V1 + V2) = Form: («A1 + A2, B1 + Bz, aC, + C2) 
co = («A1 + A2) dydz + («B1 + B2) dz dx 
(charge inside £) = ff ess- ff f (Ipli) r- nds + folk Co) dx dy 


= a(A,dydz+ By dz dx + C1 dx dy) 
+ (A2 dy dz + B2 dz dx + C2 dx dy) 
= a Formz (Wi) + Formz (V2). 


f (IPI) Ip| area £ = 4 Iip? f (IpI). 


If |ipl| < R, there is no charge inside £; if |p|] > R, the 


charge inside £ is Q, and so (b) 
1 Q À do = di dx+ 2A ay+ 2 ae A dx + A(dx)? 
= aP if Iip > R ax ay az 
E(p) = 2 47 IIPI ; i : 
; a f] ð 
0 if < R. 2 
I Pll (3 dx + ay dy + T dx) ady + B(dy) 
27. By Theorem 10, ffy F-dS= 4x for any surface ee yr | Kae 
enclosing the origin. But if F were the curl of some field, ax ay az 


then the integral over such a closed surface would have 
to be 0. But ( dx)? = ( dy)? = ( dz)? = dx ^ dx = dy a dy= 
dz ^ dz = 0, dy a dx = —dxa dy, dz ^ dy = 


29, If S = aW, then ffo r- ndS = [/'f, Joriy = — dy A dz, and dx ^ dz = — dz ^ dx. Hence, 


JS Jy 30V = 3 volume (W ). For the geometric 


explanation, assume (0, 0, 0) € W and consider the da = (= = =) dydz+ (= = <) dz dx 
skew cone with its vertex at (0, 0, 0) with base AS and dy əz əz ðy 
altitude ||r||. Its volume is =(AS)(r-n). 3B ðA 
3 + dx dy 
Ox. ay 
Section 8.5 = Eoiena ACON; 
9. An oriented 1-manifold is a curve. Its boundary is a pair 
L (a) (2xy? — yx?) dx dy of points that may be considered a 0-manifold. Therefore, 

(b) (x2? + y?) dx dy æ is a0-form or function, and Hn dw = a(b) — o(a) 


if the curve M runs from a to b. Furthermore, dæ is the 


A REY 
i ey FAA dz 1-form (3@/3x) dx + (3w/ðy) dy. Therefore, fy dæ is 


(d) (xy +x?) dx dy dz the line integral f4 (3%/3x) dw + (dw/ dy) dy = 

(e) dx dy dz i Væ- ds Thus, we obtain Theorem 3 of Section 7.2, 
3. (a) 2xy dx + (x? + 3y2) dy Jy Vo: ds= olb) — ofa). 

(b) —(x + y?sinx) dx dy IL Puto = Fy dx dy + Fody dz + F3 dz dx. The integral 

(c) —(2x + y) dx dy becomes 

(d) dx dydz 


(e) 2x dx dydz I w= fff dw 
aT 


(f) 2ydydz— 2x dz dx 


4xy =J] (i afa s) ayaz 
(g) ~ (x2 4 y2)2 dx dy 


(h) 2xy dx dydz 


(a) 0 
44r?  11n4 = 3? (b) 40 
5. (a) 802 + 3 + 7 + 5 
44r? 5304 64r? 78 x! 13. Consider w = x dy dz + y dz dx + z dx dy 
j ; 
(b) n° + 3-4 + et ta Compute that, dw = 3 dx dy dz, so that 
4° 5 
(c) Br + 10r? 49x34 a T Shon = sith dw = Ja dxdydz = v(R). 
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Review Exercises for Chapter 8 15. 87/3 
L (a) 27a? 17. xa2/4 
(b) 0 
19, 21 
3.0 


2L (a) Gis conservative; F is not. 
5. (a) f =x4/4—x2y3 


; 1 
(b) —1/4 (b) G=Vgifd = (x8 /A)+(y8/a)—Sx2y24 522 4C, 
where C is any constant. 
7. (a) Check that V x F = Q 1 
(b) f =3x2ycosz+C (c) fF ne 0; fe. 5) 
(c) 0 
fF ds= l fE ds= “5 
9. 23/6 b B 


IL No: V x (ax nr) =2a 23. Use(V -F)(Xo, Yo, 20) = 


; , imity- 0 Jf F. ndS from Section 8.4. 
13. (a) Vf = 3ye i+ 3xe2 j + 6xyze“ k (2p) J Joa, 


(b) 0 
(c) Both sides are 0. 
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Ann following a page reference indicates the information is found in a note. 


2 x 2 matrix, 31, 63 

3 x 3 matrix, 31-32, 63 

e's and 4's limits, 99-102 

In, 66 

n-dimensional Euclidian space, 60 
n-space vectors, 60-62 

R",2 

X axis, 1 

x coordinate, 1 

x-simple regions, 283, 287, 428, 430 
y axis, 1 

y coordinate, 2 

y-simple domain, 340 

y-simple regions, 283, 286, 287, 428-439 
z axis, 1 

z coordinate, 2 

0-form, 477 

1-form, 478 

2-form, 478-479 

3-form, 479-480 

symbols, xviii 


absolute maximum, 180, 192, 193 
absolute minimum, 180, 192, 193 
absolute value, xxiii 
acceleration, 217-228 
action, principle of, 166-168 
action at a distance, 243, 419 
additive inverse, 3 
adiabatic process, 375 
affine approximation, 108-109 
Alexandov, 417 
algebra of forms, 483-488 
al-K huwarizmi, xviii 
A mpère’s law, 372, 408, 452, 472 
analytic function, 166 
Andromeda galaxy, 419 
angle between two vectors, 22-23 
angular momentum, conservation of, 450 
angular velocity vector, 250 
anticommutativity, 483 
Apollonius of Perga, xv 
approximations, 158 
Arabian mathematics, xvii-xviii 
Archimedes, xvi, xix, 266, 333, 389 
arch length 
definition, 228, 231 
differential, 230-232 


formula justification, 232-234 
function, 232 
reparametrization, 234 
area 
curl as circulation per unit area, 445-448 
Green's theorem, 433-434 
surfaces, 383-392 
Argand, 46 
Aristarchus of Samus, xix 
Aristotle, xvi 
Ars Magna [the Great Art] (Cardano), 44 
associativity, 3, 46n1, 67, 483 
average value, 357 
average value of a function, 329-330 


Babylonian mathematics, xiii- xvii 
ball, volume of, 326 
basic 1-form, 478 
basic 2-form, 478-479 
basic 3-form, 479-480 
bearing, 30 
Bentley, Richard, 419 
Bernoulli, Jacob, 52 
Bernoulli, Johann I1, 155, 167, 358, 419 
best linear approximation, 110 
binormal vector, 235 
bonded function 
definition, 271 
integratability, 274 
bordered Hessian determinant, 197, 198, 199 
bound vectors, 6 
boundaries, 90-91 
boundary curve, 440 
boundary points, 90, 91 
boundary regions, 283 
bounded set, 180 
brachistrochrone, 358 
Brahe, Tycho, xx 
Bunyakovskii, 61n4 
Buys-Ballot’s law, 262 


c! 114 

calculus of variations, 358 
capped cylinder, 451 
Cardano, Gerolamo, 44, 45 
Cartesian coordinates, 1, 2 
Cartesian product, 263 
Catoptrica (Euclid), xv 
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Index 


Cauchy, A ugustin-Louis, 35, 45, 61n4, 
281, 390 
Cauchy-B unyakovskii- Schwarz (CBS) 
inequality, 61n4 
Cauchy-Riemann equations, 399 
Cauchy-Schwarz inequality, 23-25, 61 
Cavalieri, Bonaventura, 266 
Cavalieri’s principle, 265-266 
CBS (Cauchy-B unyakovskii- Schwarz) 
inequality, 61n4 
center of gravity, 399 
center of mass, 330-333 
centripetal force, 221 
chain rule 
described, 124, 126-127 
as differentiation rules, 218 
example, 153, 156 
first special case, 127-128 


implicit function theorem and surface, 206 


Lagrange multiplier method, 186 
second special case, 128-132 
Stokes’ theorems and, 441 
vector quantities and, 448 

change of variables formula 
applications, 329-338 
cylindrical coordinates, 324 
described, 307-308, 318-320 
double integrals, 319 
Gaussian integral, 322-323 
polar coordinates, 320-323 
spherical coordinates, 325-326 
triple integrals, 323-324 

change of variables theorem, 314-328 

changing the order of integration, 289-294 

charge density, 472 

chemical equation, 4 

circular orbit, 220-222 

circulation, 373, 446 

circulation and curl, 445-448 

class C ! functions, 114 

class C 2 functions, 150 

class C K, 237 

Clifford, W. K., 351, 419 

closed curve, 368 

closed interval, xxiii 

closed set, 180 

closed surface, 452 

Cobb-Douglas production function, 203 

coefficients, matrix of, 195n12 

commutative, 66 

complex numbers, 45-48 

component curves, 370 

component functions, 117 

components, 1, 4 

component scalar fields, 237 


composition, 99 
conductivity, 238 
cone, 379, 385, 392 
conformal parametrization, 399, 423 
conical refraction, 46 
conic sections, xiv 
conservation of angular momentum, 450 
conservation of energy, 240 
conservative fields 

definition, 453 

physical interpretation, 455-457 

planar case, 458-459 
conservative vector field, 453 
constant multiple rule, 125 
constant vector field, 491 
constrained extrema, 185-203 

LaGrange multiplier method for several 

constraints, 191-193 

second derivative test, 197-201 
continuity, 88-105 

of compositions, 99 

definition, 97 

open sets and, 88-90 

theorems, 113-114 
continuous functions, 95-98 
conversion of energy, 240 
coordinates, 1-2, 4 
Copernicus, Nicolaus, xvi, xix 
Coulomb's law, 239, 243, 409 
Cramer, 34 
Cramer's rule, 35 
Crick, Francis, 418 
critical points, 168, 177, 181, 182, 

186, 198 

cross product, 31, 35-39, 44, 47-48 
cross product rule, 218 
cross-sectional area, 265-266 
cross section of a torus, 391 
cubic equations, 44 
curl 

as circulation per unit area, 445-448 

definition, 249-250 

divergence, 253 

gradients, 252 

rotational flow, 251 

rotations and, 250-251 

scalar curl, 252-253 
curvature 

definition, 355 

hemisphere, 415-417 

on a path, 235 

planes, 415 

of surfaces, 414-417 

surfaces of constant, 417, 418 

total curvature, 414 


curves, 116-124 
components, 370 
integral of 1-forms over, 480-481 
knotted, 355 
line integrals over, 368-371 
piecewise, 229 
planar, 353-355 
total curvature, 355 
cyclicly permuting, 36 
cyclist, 374 
cycloid, 119 
cycloidal path, 121 
cylinder, 270, 451 
cylindrical coordinates 
change of variables, 324 
described, 52-54 
Stokes’ theorems, 448 
cylindrical hole, 392 


d'Alembert, Le Rond, 155 
DaVinci, Leonardo, 333 
definite integral, xxv 
degenerate critical point, 177 
degenerate type, 176 
Delaunay, 417 
del Ferro, Scipione, xix, 44 
del operator, 245, 256 
density 

charge, 472 

current, 472 
derivative of a function, xxiv 
derivative of a k-form, 484-485 
derivative operator, 245 
derivatives 

directional, 136-137 

gradients, 112-113 

partial, 105-108, 111 

properties of, 124-134 
Descartes, René, xix, 68 
determinants 

geometry of, 39-41 

matrix, 31-32 

properties of, 32-35, 66-67 
determinant test for positive definiteness, 175 
Dido, Queen of Carthage, 190 
Dieterici’s equation, 133 
differentiability 

functions of two variables, 109 

general case, 110-112 

tangent plane, 110 

theorems, 113-114 
differential equations, 154 
differential forms, 476-491 

0-form, 477 

1-form, 478 
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2-form, 478-479 
3-form, 479-480 
algebra of forms, 483-488 
cross product and, 48n3 
definition, 360 
Gauss’, 487 
Gauss’ theorem, 487, 488 
Green's theorem, 487 
integral of 1-forms over curves, 480-481 
integral of 2-forms over surfaces, 481-482 
integral of 3-forms over regions, 483 
Stokes’ theorems, 488 
differentiation, 105-116 
differentiation of paths, 217-219 
Dirac, Paul, xviii 
directed simple curve, 368 
directional derivatives, 136-137 
directions of fastest increase, 137-138 
Dirichlet’s functional, 399 
discontinuous functions, 97 
discriminant of the Hessian, 176 
displacement 
infinitesimal, 231 
vector, 27-29 
distance 
definition, xxiii 
from point to plane, 43-44 
between vector endpoints, 21-22, 62 
distributivity, 3, 483 
divergence 
curls, 253 
cylindrical coordinates, 448 
definition, 245 
Gauss’ theorem, 463-466, 467, 468, 470 
Green's theorem, 436-437 
Laplace operator, 254 
physical interpretation, 246 
spherical coordinates, 448, 470-471 
divergence-free, 468 
domain, xxiv 
dot product, 19-20, 24, 38, 44, 48, 55, 60, 245 
dot product rule, 218 
double helix, 418 
double integrals 
bonded function, 274 
Cavalieri’s principle, 265-266 
change of variables formula, 319 
changing the order of integration, 289-294 
Fubini’s theorem, 276-280 
mean value equality, 292-293 
mean value inequality, 292 
over a rectangle, 271-283 
over elementary regions, 283-289 
reduction to iterated integrals, 267-269, 
285-288 


536 


Index 


double integrals (Continued) 

as volumes, 263-265 
doughnut surface, 375 
dr notation for line integral, 371 


economics, 196-197 
Egyptian mathematics, xiii-xvii 
eigenvalue, 203 
eigenvector, 203 
Einstein, Albert, 243, 418-420 
Einstein’s field equations, 420, 422 
Einstein's general theory of relativity, 418 
elasticity, 155, 348 
electric field, 472 
Electromagnetic Theory (Heaviside), 49 
elementary regions 
described, 297-298 
double integrals over, 283-289 
Gauss’ theorem, 461-463 
Green's theorem, 428 
other types of, 299 
symmetric, 300 
triple integrals over, 298-302 
Elementary Treatises on Quaternions 
(Tait), 47 
Elements (Euclid), xv 
Elements of Vector Analysis (Gibbs), 49 
ellipsoid, 391, 413 
elliptic, 213 
endpoints, 368 
energy, conversion of, 240 
energy vector field, 238 
epicycles, xv, 119 
epicycloid, 119 
equality of mixed partials, 151, 152 
Equilibrium (Archimedes), 333 
equipotential surfaces, 141, 239 
escape velocity, 240-241 
Escher,M.C., 402 
Euclid, xv, xix, 236, 333 
Euclidian n-space 
matrices, 63-73 
vectors in, 60-62 
Eudoxus, xv 


Euler, Leonhard, 45, 48, 76n1, 149, 152, 155, 


187, 222, 390 
Euler equations, 152 
Euler's theorems, 146 
European mathematics, xviii- xxi 
exceptional points, 453 
exhausting regions, 340 
extrema of real-valued functions, 
166-185 
extreme points, 168 
extremum, 168 


Faraday’s law, 407, 449-450, 472 
Fary-M ilnor theorems, 356 
fence, Tom Sawyer's, 354 
Feynman, Richard, 222, 223-224, 427 
Feynman integrals, 223 
field concept, 242-243 
Fields medal, 356 
Fior, Antonio, 44 
first-order Taylor formula, 159, 160, 164 
flexural rigidity, 348 
flow lines, 241-242 
flux, 407, 408, 467-468 
flux per unit volume, 467-468 
Fontana, Nicolo, 44 
force fields 
gravitational, 238, 239, 240, 459 
work done by, 358-359 
force vectors, 29 
Fourier, Joseph, 154 
Fourier series, 154, 281 
free vectors, 6 
Frenet formulas, 235 
frequency, orbit, 221 
frustum, 392 
Fubini, Guido, 281 
Fubini’s theorem, 268, 271, 276-280, 
342-344 
functions 
analytic, 166 
arch length, 232 
average value, 329-330 
class C1, 114 
class C2, 150 
Cobb-Douglas production function, 203 
component, 117 
continuous, 95-98 
definition, xxiv 
differentiability, 109 
graphs, 77 
Green's, 475 
harmonic, 157 
mappings and, 76-77 
one-to-one, 310-311 
onto, 311-313 
potential, 455, 458, 475 
quadratic, 172 
of several variables, 76 
smooth, 193 
functions unbounded at isolated points, 
344-345 
fundamental solution, 156 
fundamental theorem of algebra, 45 
fundamental theorem of calculus, 159, 232, 
280, 430-431, 476 
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25. J cosh dx =sinhx 
26. J tanhx dx = log feosh | 
27. J cothx dx = log jinh x 


28. f sechxax = arctan (sinh x) 


x coshx +1 
29. | cschx dx = log tanh $| = 2 coshx 1 


30. J sinn? x dx = 7 sinh 2x = x 
3L f cost? x x = 7 sinh 2x + - 
32. f sech? xa = tanh x 

33. [sion Sax = xsinh? = — vx? +a? (a> 0) 

x cosh? - =i aa [cosh (=) >0,a > o] 


X 
3a [ cosh? © dx = 
a x cosh? Š + x2 — a? [cosh (>) <0a>0] 
a a 
35. tant? dx = x tanh © + $ og fa? — x? 
a a 2 
1 x 
36. | ——— dx =| Jaz + x2 = sinh? — 
| og (x + va? + x? = sin F (a > 0) 
a7. | : iec acon (a > 0) 
a? + x2 a a 
2 
sa f Va? ax = 5 a2 eti arcsin = (a > 0) 
2 23/2 Xie? 2 3af cin X 
39. / (a* —x’‘) dx = 3(5a — 2x*)Va2 = x24 3 arcsin = (a > 0) 


40. [een rn (a > 0) 


a +X 
a—xX 


1 X 
= | woe" = ae 
2 
a | x Ła?dx = ŽV bar S log |x + Vx? a 


a. | ax = log |x + vi 7 a2| = cosh? = (a > 0) 
1 1 
Sfi ato T a aox 
_ 3/2 
æ fx reaver 2(3bx ane sen) 


(Continued at the back of the book) 


Ja + bx 1 
47. =? a 
J 7 dx vaT +a | dn 
æ | X dx — 2(bx = 2alva + bx 
Ja +bx 3b2 
L gg MOA a a 
1 Ja Ja + bx + ./a 
e. |== 
AVTE = arctan cay (a <0) 
Ga = 
so. f Pi dx = ya? — x? a log $ a 


5L [=E = -5e R 


4 
S2 feya x dx = Ea — a2) Va? 4S arcsin = (a > 0) 
2_ y2 

s | 1 sie Bigg me X 

XVa? — x2 a X 

x 
54. dx = a? — x2 
l= 

s | x dx = X a Tr a (a >0) 

Jaz — x2 — 2 2 a 


JX + a a + VX? + a? 
se | 7 dx = y x? +a? — a log 7 


/y2 _ 32 
s. | — : dx = yx? — a? — a arccos (a > 0) 


Ix] 
xvx? +a 


X? a? ae 
s -p= 5 = log (x + x? +a?) 


1 
59, dx = 
I; x2 +a? 


X 
a + Vx? +a? 


a 
arccos (a > 0) 


1 7 X 
a f Sas tx = In] Vee Eat + = 
1 2ax +b — Vb? — 4ac 2 
log (b* > 4ac) 
1 b? — 4ac 2ax +b + vb? — 4ac 
a | oe 2 2ax +b 
Scary ae arctan (b? < 4ac) 


„/4ac — b? /4ac — b? 


X 1 2 b 1 
af == OJJA Oe sf sec 
Gz 09 ax +b +25 Vax + bx +e (a > 0) 


1 
a |i 
L ax +bx +c i A nie (a <0) 
/—a ~b? — 4ac 


2ax +b 4ac — b? 1 
66. | Vax? bx + cdx = ax2+bx +c + / dx 
4a 8a ax?+b+c 


e. | X vax? +bx +c a 1 ay 
"J sax +bx+c a 2a J Vax: +bx +C 
—1 2./CVax? + bx +¢+ bx + 2c 
log (c > 0) 
ne 
xvax?+bx+c¢  ) 1 ee bx + 2c wei 
a — < 
fe |x| /b2 — 4ac 
69. TERCET: = (5x? - 5%) (a2 + x2)3 
Vx? a? = /(x? £a2)3 
70. f dx = 
x4 3a2x3 
= sin(a—b)x  sin(a+b)x 3,2 
7L f sinaxsinbx dx = a —b) Ma +b) (af Æ bf) 
, cos(a —b)x cos(a +b)x 
72. = 2 2 
J sinax cosbx dx Zla —b) Ma +b) (af Æ b‘) 
_ sin(a—b)x  sin(a+b)x 2,2 
73. [ cosax cosbx dx = Haad EN (at Æ b‘) 
74. J secxtanx dx = secx 
75. J escxcotx dx = ~cscx 
m-1 y ejpn+1 _ 
76. [cost xsin x dx = Aan R ~ | cos”? sin'x dx 
m+n m+n 
inn-l m+1 _ 
E B a A = f cos" xsn?xdx 
m+n m+n 


; 1 n 
77. [x sinaxax = Sx cosax +2 [xt cosax dx 
n 1 Ne} n n—-le 
78. | X cosax dx = =X sinax — z x" sinax dx 


x'e? n 
79. fred a mf xe dx 


logax 1 
n — yn+1 
a fx logax dx = x oa ai | 


n+1 


n m — X m m n m=1 
8L f (logax)”dx = rp f (logax)™=* dx 


e?*(a sinbx — b cosbx) 
a? + b? 

e**(b sinbx + a cosbx) 
a? + b? 

34. J sechxtanhx dx = —sechx 


82. fe sinbx dx = 


83. j= cosbx dx = 


85. J cschx coth x dx = —cschx 


SYMBOLS INDEX 
SYMBOLS ARE LISTED IN ORDER OF THEIR APPEARANCE IN THE TEXT 


SYMBOL 


R 
[a, b] 
(a, b) 
[a, b) 
(a, b] 
ja] 
Q 

R” 

i, j, k 
ilal 
a-b 
axb 


fda = I f(x, y) dx dy 


fr! = fff tty 2vexayae 


NAME 


real numbers 

closed interval {x |a <x <b} 

open interval {x |a <x <b} 
half-open interval {x |a <x <b} 
half-open interval {x |a <x <b} 
absolute value of a 

rational numbers 

n-dimensional space 

standard basis in R? 

norm of a vector a 

inner product of the vectors aand b 
cross product of the vectors aand b 
cylindrical coordinates 

spherical coordinates 

disk of radius r about X 

limit as xapproaches X 


left-hand limit; x > b from below 
partial derivative of f with respect to x 


derivative of f at the point X 

grad f, gradient of the function f 
continuously differentiable 

a path 

twice continuously differentiable 
Hessian of f at the point x 

del or nabla 

div F, divergence of F 

curl F, curl of F 

Laplacian 


double integral 


triple integral 


J acobian 
opposite path 


path integral 
line integral 
scalar surface integral 


vector surface integral 


